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ON AUTOREGRESSIVE TIME SERIES 
By M. G. KENDALL 


1. The product-moment correlation coefficients obtained by correlating the members of 
a time-series among themselves provide a useful method of investigating the behaviour of 
the series, especially in regard to oscillatory movements. Consider a series of values 2, ... 2, 
measured about their mean, trend-free and defined at equal intervals of time ¢ = 1,...,n. 
The kth serial correlation 7, is defined as the correlation between members of the series k 


intervals apart, iil 

x LjLiip 

— 

fee ee | +* (1) 
(= 29 = ats] 


j=1 j=l 





The figure obtained by plotting r, as ordinate against k as abscissa and joining each point 
to the next is known as the correlation diagram or correlogram. Since ry = 1 the correlogram 
always starts from the point (0, 1). When the series is infinite in extent, the serial correlations 


become auto-correlations (denoted by a Greek p), and the series for which such correlations 
do not vanish may be said to be autocorrelated. 


2. If the time series is random the correlogram presents no systematic appearance. If 
it consists of a simple harmonic the correlogram reproduces that harmonic. If it results from 
a moving average of finite extent d of a random series the correlogram will, within sampling 
limits, be zero after k = d. The correlogram thus provides a criterion for distinguishing 
between various kinds of oscillatory time series. In this paper I propose to consider the 
correlogram of a series defined by the difference equation 


Upyg + AMy,5 + bu, = E49, (2) 


where a and 6 are constants and ¢,,, is a random variable. This equation I shall call the 
generating equation and the coefficients a and 6 generating coefficients. The generated 
seriés may be said to be autoregressive. The notion of generating series in this way was 
introduced by Yule (1927) in a classical paper on sunspot periodicities and has been applied 
to meteorological and economic series with some success. The autoregressive scheme can, 
in fact, explain a typical phenomenon of such series for which Fourier and poriodogram 
analysis cannot satisfactorily account without great artificiality, namely, the continual 
shift in phase and variation of amplitude which occur even when the series is smooth. 


Accounts of the autoregressive scheme have been given in the books by Wold (1938) and 
Davis (1941). 


3. The equation (2) may be solved by the ordinary methods appropriate to difference 
equations. If the roots of 


f&+ab+b=0 
are a+78, a—if, the complementary function of (2) is 


p'(A cos Gt + Bsin @t), (3) 


‘AL ; 
where p = +,/b, 0= arctan’ = arctan | (3 _ i), and A and B are arbitrary constants. 
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106 On autoregressive time series 


It is here assumed that 6 is positive and that 4b >a. It is also to be assumed that ./b = pis 
not greater than unity. The complementary function (3) then represents a damped harmonic. 
I shall call 27/@ the fundamental period of the generated system. 

Let &, be a particular value of (3) such that 


Eo = 0, é, = 1, (4) 
‘ 2 ; 
i.e. such that &, = ap? a) p'sin Gt. (5) 
Then a particular integral of (2) will be found to be 
zs; €1_-j41° (6) 
I= 
The complete solution is 
u, = p'(A cos 6+ Bsin 6t)+ > F554. (7) 
: ot 


In practical cases we may assume that the series was ‘started up’ some time ago, so that 
the complementary function has been damped out of existence. The series is then given by 


tt, = = £56341" (8) 
j= 


This is a moving sum of a random series with damped harmonic weights. It has been gener- 


ally assumed, apparently on the basis of experimental evidence, that the mean period of the ' 


generated series will be 27/0, the same as the fundamental period present in the term £. 
This, however, is not necessarily so. Something depends on what we call a period in the 
generated series, whether, for example, we decide to exclude small ripples on the main wave. 
One possibility is to define the period as the distance between successive ‘upcrosses’, i.e. 
points where the series vhanges sign from negative to positive. The mean distance between 
major peaks or major troughs will not, probably, be very different from this in the majority 
of practical cases.* 

Some idea of the mean length of the period as so defined can be obtained for particular 
distributions of ¢, though a general discussion presents great difficulties. Consider the sum, 
which will be required again later, 











> 4 a a , 2p* ; P 
babe = yar sin 0) sin 0(j + k)} = jgh iP ete — ons O( 27 + k)}] 
_ — 2pk ( 6k _cos0k—p* cos O(k — 2) (9 
~ 4p?—a?\|1—p? 1 — 2p? cos 26 + p4 } 
2pcos0 —a 
We have JE 5 E545/2E? = lee "Tas" cos¢, say. (10) 


Now if ¢ is normally distributed, the mean period (from one upcross to the next) is 27/¢ 
—-a = —a 
2p = 2./b 
* Dodd (1939) points out that in series generated by moving averages of random series there sometimes 
occur oscillations above or below the z-axis which would not be taken into account by counting upcrosses. The 


(Dodd, 1939). Also it is easily seen that cos@ = Thus the period given by the 


difficulty is to decide which of these is not to be ignored es a ‘ripple’. I think that for harmonic weights such © 


as are given by the autoregressive scheme the method of upcrosses is satisfactory, as indicated in paragraph 4. 
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generating equation is not in general the same as the period in the generated series, defined 
as the mean difference between upcrosses. The ratio of the first to the second is 


a 
arc COS +b 
2 Bb 

4. One would expect differences of a similar kind when the variation of ¢ is not normal. 
Experiments indicate, for example, that rectangular variation of ¢ gives much the same 
periods as normal variation. It does not seem to have been previously remarked that the 
mean period of the generated series is not that of the fundamental, but part of the explanation 
is no doubt due to the fact that for ranges of 6 encountered in practice (say 0-5 <b < 1-0) the 
value of the ratio (11) is not very different from unity. In the extreme case b = 1 (when the 
series becomes undamped) the ratio is exactly unity. If 6 = 0-5, a = —1, the ratio is 1-07. 
Notwithstanding the theoretical difference exhibited by equation (11), I think we may take 
it that for most practical purposes a good estimate of the observed period (upcross to 
upcross) is that given by the generated equation, if it is known. Below I give examples of 
two artificial series of the type of equation (2) which support this conclusion. 


(ll) 


arc COs ——; 


5. Equation (9) provides one further interesting item of information, namely, the rela- 
tionship between the variance of ¢ and the variance of the generated series. We have 


— _ 
varu = lim — E (Efe. sa) = > évare, 
j=0 


n—>o NMt=1 


cross-product terms in € vanishing since it is a random variable: 














vart _ se __ ? 1 1—p?cos20 |  _—1+b 
vare ~~ 4p?—a?\l—p? 1—2p%cos26+p4)/ (1—b){(1+b)?—a} 
var é 
a ee an 12 
Thus —— = a+ oy a*}. (12) 
In an actual series of 65 terms (referred to in detail below) with a = —1-1, 6 = 0-5, the 


observed ratio of variances was 0-43. The value given by (12) is 0-35. In another series of 
the same length for which a = — 1-5, b = 0-9, the observed ratio was 0-04, the value given by 
(12) was 0-07. It is difficult to judge how good the agreement is, but to me it appears satis- 
factory for such short series. It is noticeable that the generated series may have a very much 
larger variance than the random series on which it is based. 


6. Consider now the autocorrelations of the generated series. We have 


COV (U;U;,4) 


si = 
lim ~ {> (F;€-341) x (E;€:+n-j41)} = Vare Pa bik 
3 3 = 


, 2p* {(cos@k cos0k—p* cos O(k—2) 
var dp? —a*\1— p ieee 


: : ie . k-2 
Bence pp 3 Se eR 3) - Se ot ee 


(1 +p 2) (1 —cos 20) 


_ P*{sin ( (k+1)0-—p sin (k—1)9} 
(1+ p*)sin@ 
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l 2 
Writing tan y = 2 4 5 tan 8, 
_ pr(l—2p*cos20+pt}t . 1, sin (k0+ p) 
we have Pp, = — (+pysind sin (kO+y) =p ao (13) 


Apart from the constant factor, p, is thus the product of the damping factor p* and a 
harmonic term which has the fundamental period of the generating equation. It is note- 
worthy that, although the point p, = 1 is a peak at the beginning of the correlogram the 
existence of the phase angle ~ implies that the interval from k = 0 to the next maximum 
of the correlogram is not equal to the fundamental period. In judging the length of the 
period from the correlogram it is therefore better to measure from upcross to upcross or 
from trough to trough; or, if peaks are preferred, not to count the maximum at k = 0 as 
a peak. 


The same result may be obtained in two other ways. Multiplying equation (2) by u,_; and 


summing for all ¢ we have Z (E02 %_x) 
— ee eee 


ota +bp, = 
Piet Uri + OP; vere 


Since w,_, depends only on ¢,_;, and terms with lower subscripts, the expression on the right 
vanishes if & is not less than —1. We then have 


Prr2+ Oxi + bp, = 9 (k>—1). (14) 
This result is due to Walker (1931). It was pointed out by Wold that if we multiply (2) by 


Ui+n42 and sum we get 


~(E142Ut+K+2) 


a b = 
P+ OP K41 t+ 9PK+2 cnet 


The expression on the right no longer vanishes. In fact w,,,,. contains the term &,. 16.» 
and we have 


vareé 
Pr+Pps1t OPpre = ——— Sear (kK> —1). (15) 
If k = —1 the two equations become identical, for p_, = p, and we have 


a 
p,(1+b)+a =0, he (16) 
If we now solve either of the difference equations (14) and (15), making use of the initial 
conditions py = 1 and (16) we arrive back at equation (13). 


7. The foregoing result (13) would lead us to suppose that the correlogram of an auto- 
regressive series would be damped according to the factor p*, and this is true for an infinite 
series. In a number of practical cases, however, I was puzzled by the fact that correlograms 
of series which appeared on the face of it to be of type (2) did not damp out in the required 
way. Figs. 1 and 2 show the correlograms of two series for wheat prices and sheep population. 
The original series were taken from the agricultural returns for England and Wales, trend 
eliminated by a nine-years moving average and the first thirty serial correlations computed 
for the resulting series of 64 or 65 terms. The data and the correlations for what prices are 
given in Tables 1 and 2; those for sheep have been given in a previous paper (Kendall, 1941). 
I found a similar effect as regards non-damping in nine other agricultural time series, though 
the correlograms were not so regular as in these two cases. It was always possible, however, 
that the failure of the fluctuations to damp out in the correlogram was due wholly or partly 
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Fig. 2. Correlogram of the sheep population data referred to in paragraph 7. 
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Correlogram of the wheat price data of Tables 1 and 2. 
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to failure on the part of the original series to conform to the scheme of equation (2). To avoid 
such complications I constructed an artificial series with equation 

Uppe = Ley.) —O-5uy+ &,9. (17) 
The e’s were taken from tables of random numbers and consisted of positive or negative 
numbers ranging by units from —9-5 to +9-5. The series was ‘started up’ from zero by 


Table 1. Wheat prices, 1871-1934 inclusive, deviations from 
nine-years moving average in pence per hundredweight 




















Year Price Year Price Year Price 
1871 — 5 1893 6 1915 — 3 
2 -13 4 18 6 -— 7 
3 —21 5 15 7 —40 
4 — 6 6. 1 8 —28 
5 19 7 -11 9 —27 
6 ll & -17 1920 —65l1 
7 —20 9 5 1 —14 
8 3 1900 3 2 37 
9 8 1 4 3 42 
1880 2 2 2 4 ll 
1 — 6 3 3 5 -— 9 
2 —11 4 - 1 6 -17 
3 -— 7 5 2 7 -—15 
4 6 6 5 8 -— 5 
5 9 7 0 9 —10 
6 9 8 -3 1930 -2 
7 4 9 -— 9 l 19 
8 3 1910 1 2 14 
9 4 1 9 3 21 
1890 - 3 2 ll 4 19 
1 —20 3 32 
2 | — 5 4 | 33 














Table 2. Serial correlations of the wheat price series of Table 1 





























| 
Order of Order of Order of 
correlation J correlation . correlation ¥ 
1 +0:577 ll +0-171 21 | +0115 
2 + 0-025 12 —0-075 22 —0-144 
3 — 0-267 13 — 0-302 23 —0°204 
4 — 0-402 14 — 0-332 24 —0-195 
5 — 0-389 15 —0:317 25 —0-221 
6 —0-340 16 — 0-282 26 —0-052 
7 —0-174 17 —0-049 27 +0-217 
S +0-166 18 +0-200 28 +0:404 
9 +0-411 19 +0-360 29 + 0-364 
10 +0:372 20 | +0343 30 | +0024 








assuming U») = u_, = 0. The effect of assuming two consecutive terms equal to zero is not 
important, for the contribution to a term of the series of terms far back is very small. We are 
therefore entitled to expect that an artificial series constructed in this way should conform 
to the foregoing theory. The resultant series and the serial correlations are given in Tables 3 
and 4 and the correlogram in Fig. 3. Now this series is heavily damped, p being (0-5) = 0-7071. 
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At the twentieth serial correlation, according to equation (14), rs) should be less than 0-002 
in absolute magnitude. Actmally it is one hundred times as big. 


8. The explanation lies, I think, in the shortness of the series. In arriving at equation (13) 
it was assumed that product sums such as Le,¢;,, were zero; and this is so for a series of 
infinite length. But when the series is short these sums may differ quite appreciably from 


Table 3. Artificial series w,. = 1-1u,,, —0-5u,+ €,2 constructed as described in the text 








l ] 

No. of Value of No. of Value of No. of Value of 

term series term series term series 
1 7 23 — 4 45 —13 
2 6 24 — 5 46 1 
3 — 6 25 -— 9 47 6 
4 —4 26 —4 48 4 
5 3 27 — 4 49 ll 
6 — 4 28 3 50 15 
7 — 5 29 9 51 9 
8 - 1 30 4 52 8 
at) 10 31 - 8 53 4 
10 10 32 — 6 54 - 1 
ll 6 33 — 3 55 4 
12 — 4 34 — 2 56 7 
13 -—4 35 0 57 ll 
14 — 7 36 - 1 58 0 
15 - 2 37 - 3 59 1 
16 6 38 3 60 0 
17 17 39 -—1 61 -— 5 
18 24 40 - 8 62 -11 
19 17 4l -— 3 63 -— 8 
20 4 42 — 8 64 —- 3 
21 1 43 —10 65 5 
22 — 5 44 —16 


























Table 4. Serial correlations of the series of Table 3 


























Order of Order of Order of | 
correlation . correlation | . correlation “ 

| 

1 +0-70 ll | —0-05 21 +0-05 

2 +0-29 2 | -017 22 —0-12 

3 +0-01 13 — 0-27 23 —0-28 

4 —0:17 14 | —0:31 24 —0-43 

5 —0-27 15 —0-30 25 —0-57 

6 —0-25 16 —0:18 26 —0-56 

7 —0-13 17 +012 27 —0-26 

8 +0-07 18 +0-29 28 +0-02 

9 +0-12 19 +0-33 29 +0-17 

10 +0-05 20 +0-22 30 +0-27 





zero. For instance, in samples from a normal population with zero correlation, the standard 
error of the correlation in samples of 65 is about 0-125, so that values of 0-25 are not very 
improbable and even values as high as 0-375 are not impossible. 

Consider then the series of n terms such as 


U; = £6; + e6;_1 + Es6;at...- 
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The product moment of u,; and u;,;, will be 
x {S22 = > x £65141 >> Em €j+k—m+1 = 22 £:5mej—141 €34k—m+1" 
j jl m jim 


The terms in e? give the expression (13). The others will be sums of products of the &’s 
multiplied by the serial covariances of the e’s themselves. The dominating terms in these 
latter will be those containing the larger £’s. These terms are themselves damped harmonics 
of the fundamental type and when applied to the sums Ze;e;,, may be expected to generate 
oscillatory movements of about the same period as the original series. We may therefore 
expect that for short series the correlogram of the autoregressive system may not decay 
very rapidly, but that the product terms may themselves result in a small fluctuation. This 
appears to be happening in both the practical and the artificial examples given above. 

















Values of r, 


5 | 15 20 25 30 
iit Values of k / 



































Fig. 3. Correlogram of the artificial series of Tables 3 and 4. 


9. Putting aside the mathematics for a moment and looking at the point in a general way, 
we can, [ think, appreciate that something of the kind is to be expected in short series. The 
variance of a short series should not differ systematically from the variance of the infinite 
series; but the covariances may systematically exceed in absolyte value the value for the 
infinite series. In fact, as we proceed along the series, the oscillations change in phase, and 
when we have gone far enough will be quite unrelated in phase to the initial oscillations; but 
if we only go so far as the second or third oscillation the last oscillation may not, so to speak, 
have had time to get very much out of phase with the first. The consequence will be that the 
correlations for such a short space of time will tend to be higher than those for the series as 
a whole. For the generated system (and indeed for time series in general) we have to be 
careful not to assume too lightly that values calculated for a part of the series are typical 
of the corresponding values for the series as a whole; and this notwithstanding that the part 
of the series was chosen ‘at random’. 
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One would like to know how long a practical series must be for the damping to show itself 
decisively. An exact discussion of this point presents difficulty, because in a finite series the 
serial covariances 2¢;€;,, are not only non-vanishing but are correlated among themselves. 
Suppose, however, that we have a series with a damping factor p. Then the kth serial co- 
efficient will not exceed p* in absolute value, and the difference of the correlogram as a whole 
from expectation will not at any point exceed the error due to the sampling fluctuation of 
the serial correlation between u; and u;,,. This in turn will not exceed the sampling error of 
the correlation in samples of n from an uncorrelated series, which for large samples has a 
standard error n+. For example, if we took a series of 650 terms instead of 65, correlations 
up to the 30th would have a standard error not exceeding 1/,/(650—30) = 0-04, and hence 
fluctuations of 0-10 would not be impossible. But by the 30th term in the correlogram the 
serial correlation has been practically damped out of existence. The fair inference is, I think, 
that except for the first few serial correlations of the main series, which are not damped 
very much, the serial correlations are seriously affected ever for long series, for the sampling 
errors are not negligible compared with the smail damped ‘true’ values. In general one 
would expect the damping effect to show itself for the first five, ten or twenty terms and then 
to be obliterated by the sampling effect. Whether this happens at the fifth, tenth or twentieth 
term depends not only on ¢ but on p, the rapidity of damping. In most of the natural series 
I have examined the damping is fairly rapid, so that the damping effect in the correlogram 
disappeared after the first few terms. The economic examples given by Wold (1938) and the 
meteorological examples of Walker (1931) appear to me to support this conclusion. 


10. I turn now to consider another effect which may obscure the presence of the generated 
system of type (2) and may exert an important influence on the correlogram. The random 
element € considered up to this point is what Yule calls a disturbance, and is integrated into 
the course of the series by the autoregressive scheme. There may also be a component 7 
superposed on the system. This superposed element, if random, is like an error of observation 
in that its value at any point is unrelated to its value at other points. 

If a random element with variance var » is superposed on an infinite series with variance 
var u the variance of the whole will be vary + var u. The autocovariances, however, will not 


be affected (except by sampling effects for short series). Consequently all the autocorrelations 
except py will be reduced in the ratio 


ES... aoe (18) 


For short series there will still be a reduction of the same type, but the value of c may differ 
from its theoretical value for sampling reasons. 


11. An autoregressive series of 65 terms was constructed according to the formula 
Uppe = 1-5uy,.—O-9u,+ €,9, (19) 
where the ¢,,. were the values of random numbers proceeding by units from — 49-5 to 49-5 
with a theoretical variance of 833. On to the series so derived there were superposed (a) a 
rectangular random element — 49-5(1) 49-5 and (4) a further rectangular random element 
— 199-5(1) 199-5, additional to the first, the latter series then being divided by ten and 
rounded up to the nearest integer. The resultant series and serial correlations are shown 
in Tables 5 and 6, and the correlograms in Fig. 4. 
According to (12) the variance of u for an infinite series generated by (19) should be 
13-97 var e. The values of c for the two series here considered are then, respectively, 
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13-97/14-963 = 0-93 and 13-97/30-963 = 0-45. In the second case the effect of the super- 12. 
posed variation is to halve the observed correlations. The effect on the actual series of and re: 
65 terms is somewhat irregular (see para. 14 below). adopte 
of the 
Table 5. Artificial series u,,. = 1-5u,,, —0-9u,+ €,9, (a) with small superposed element 7, usual 
(b) with large element 9, constructed as described in the teat series | 
that fe 
| 
Value of series Value of series Value of series 
| No. of No. of No. of 
term term term 
(a) (6) (2) () (2) (0) The o 
that v 
1 5 16 23 —215 —34 45 88 3 
2 36 7 24 —219 —15 46 76 - 5 
3 8 10 25 — 76 -—17 47 93 21 
4 - $1 -— 5 26 . 95 —4 48 34 8 
5 — 89 7 27 239 15 49 60 15 
6 - 15 - 7 28 316 23 50 —- 69 | -—13 
7 35 10 29 289 21 51 —120 | —10 
5 112 13 30 169 13 52 - 5 | 12 
9 146 17 31 49 -— 7 53 — 56 -—13 
10 100 19 32 —114 — 26 54 12 7 
ll 1 7 33 — 259 — 26 55 5 -ll 
12 —131 1 34 — 290 — 26 56 13 —18 
13 — 195 — 6 35 — 208 —29 57 3 - 1 
14 — 259 —27 36 — 31 -—4 58 13 ll 
15 — 258 —37 37 21 12 59 -— 4 9 
16 -—118 —28 38 109 18 60 -— 71 -—13 
17 32 6 39 26 19 61 — 94 - 7 
18 110 - 3 40 33 -— 5 62 | 15 19 
19 245 6 41 22 0 63 | 45 - 3 
20 166 24 42 — 30 ~10 64 | = 138 19 
21 79 15 43 51 -—15 65 115} 0 
22 -177 —34 44 49 17 | 
| 
Table 6. Serial correlations of series (a) and (b) of Table 5 
| | 
| r r r 
Order | Order Order | 
of cor- | of cor- of cor- 
relation | (a) (b) relation (a) (b) relation | (a) | (b) 
—— % | 
1 +0-78 +0-49 11 +0-33 +0-16 21 | +021 | 40-05 Fi 
2 +033 | = =+0:13 12 —0-04 —0:16 22 | -007 | —0-28 
3 -—0-22 | -013 13 —0:38 —0-37 23 | -028 | —0-33 
4 -063 | —0-42 14 —0-58 —0-50 24 | —0-32 —0-29 
5 —0-75 —0-46 15 —0-52 —0:36 25 —0-22 — 0-20 Fro 
6 —0-59 —0:39 16 —0-23 —0-14 26 | -001 | -0-07 
7 —0-22 —0-01 17 +0-14 +0-19 27 +016 | +011 the 
8 +019 +0-28 18 +0-43 +0-42 28 | +023 | +0-20 oun 
9 +0-48 +0-38 19 +0-55 +0-41 29 | =+013 | +001 - se 
10 +0-53 +0-52 20 +0°45 +0-37 30 | —0-11 | -0-11 wil 
c is 
The correlograms run according to expectation. The effect of the bigger random element 


is to reduce the amplitude at the beginning of the series and to introduce some minor irregu- 
larities in the data, but not to affect substantially the lengths of the correlogram oscillations. 
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12. But here arises one important difficulty. Suppose we are given such series as these 
and requirs to estimate a and b, the constants of the generating equation. The procedure 
adopted by Yule was as follows: we take the observed serial correlations r, and r, as estimates 
of the autocorrelations. We then find the regression equation of u,,. on u,,, and u, by the 
usual methods, assuming that the variance of the series is the same as the variance for the 
series less its first term and that the serial correlation r, for the whole series is the same as 
that for the series less its first term. The regression equation is 


r,(1—r.) r.—?7? 
U2 = =" w t+1 — 7 tte (20) 





The observed regression equation is then taken as an estimate of the generating equation so 
that we have as estimates of a and b 


a —r2 ‘a 
te 2S Te) yes Tr] 12 re (21) 


1—r ’ ~ l=-rn 1-r 























Values of r, 



































—1-0 
Fig. 4. Correlogram of the two artificial series of Tables 5 and 6, the full line representing series (a) with 
slight superposed variation, the broken line series (6) with large superposed variation. 


From the above it will be evident that there are two sources of possible error in the use of 
these equations: (1) if the series is short r, and r, may not be reliable estimates of the auto- 
correlations for the infinite series, (2) if there is any superposed variation the observed r’s 
will be lower than the true r’s for the autoregressive system, being in fact cr, and cr, where 
c is given by (18). Consider the second of these effects. 


13. In the case of superposed variation the use of (21) will lead to the equations 


1—c3? ’ ~ ler 





ve cr,(1—cr,) ,  Crg—c*r? 
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The estimated fundamental period of the generating equation is then given by 


9 


a’? cr?(1 — cre) 
‘(1 =e? r?) (vr, — er?) 





If we expand (22) in powers of y = 1 —c we find, to first order in y, 


a’? a®{_ | (1+6)(3b®@—b—a2)| 


bo bl Ube) 





Hence, if 3b7—b—a?>0 the effect of a superposed variation (equivalent to positive y) is 
to give a2 a 


wy < 5° 
or in other words to result in a shortening of the observed period. The condition that 
3b? — b—a*?>0 is equivalent to 
b> H{-—14.,/(12a? + 1)} 


and is not very restrictive since in any case a? <4 and 4b >a®. The inequality is obeyed by 
all the examples I have met in practice. 

We therefore reach the interesting conclusion that if there is any superposed random 
variation present, the period calculated from the observed regression equation according to 
formulae (21) will probably be too short even for long series. Yule himself found too short 
a period for his sunspot material and, suspecting that it was due to superposed variation, 
attempted to reduce that variation by graduation. The result was to give a longer period 
more in accordance with observation. It does not appear, however, that the superposed 
variation in his case was very big. In a number of agricultural time series which I have 
examined it is sometimes about half the variation of the series and the effect on the period 
as calculated from the serial correlations is very serious. For instance, in the cases of 
wheat prices and sheep population referred to above, formulae (21) give periods of 7-0 


and 6-8 years, whereas the correlograms indicate periods of about 9-5 and 8-5 years 
respectively. 


14. As an example, consider the second of the artificial series referred to in paragraph 11. 
For the observed serial correlations I find 


r = 0-486, r, = 0-133, 
giving, according to (21), 
—a’ = 0-552, b’ = 0-135, cos6’ = —— 


56 = 0-751, 6’ = 41:3°, 


which corresponds to a period of about 8-7 years, whereas we know from the construction 
of the series that 


a=15, b6=0-9, 
giving 0 = 37-7° and a period of 9-5 years. 

Considering the profound effect which the superposed variation has had on the first two 
serial coefficients, reducing r, from 0-78 to 0-49 and r, from 0-33 to 0-13, one might have 
expected the period to have been affected even more than appears from this result. But the 
example serves to bring out once again the difficulties associated with short series and the 
unreliability of coefficients calculated from the first two serial correlations in such cases. 
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If, for instance, we had found r, = 0-18 instead of 0-13 we should have obtained a period of 
about 12 years, and if r, = 0-20 the solution becomes impossible, for a’ and 6’ then assume 
values such that a’*> 4b’ and cos@’>1. Furthermore, such a value as 0-18 for r, increases 
the period instead of decreasing it. We note, in fact, that r, has been reduced by 
0-13/0-33 = 0-40 so that it is not legitimate to assume that r, and r, are reduced by a con- 
stant c. The errors introduced by neglect of the autocovariances of the superposed random 
element may be so serious as to destroy the value of calculations based on the observed 
regression coefficients. 


15. Even in long series, when it is legitimate to suppose that r, and r, are reduced in the 
same proportion, the length of the period is very sensitive to superposed variation. Consider, 
for example, the effect of small variations in c near c = 1. We have from (22), differentiating 
logarithmically and putting c = 1, 











dé 2r, 2r2 r? 
a 2 ey wes 2 1 1 
aborts , l—rg 1-7? r.—1?’ 
hich on substitutin pe wae 
whicn on §s £ i= 1+8° %~=-— i335 
d@__(1+b)(3b?+b—a?) 
reduces to tan 0 = “i +b)—a5 (23) 
: 4b 
Now the period P=27/6 and tan@ - JG - 1) ; 
a 
dP ~—s P*a(1 +b) (3b?+6b—a?*) , 
Hence —* ~ dr (4b—a*) {(1 +6)? —a?} (24) 
When a = —1-5,b = 0-9, P = 9-5, this reduces to 
dP 


—* 14 intervals. 
Thus if c = 0-9, i.e. the superposed variation is only about 10 % of the total, the period may 
be shortened by something of the order of one interval. 


16. The position may then be summarized as follows: 

(a) The correlogram of a generated system of the type of equation (2) will be damped 
according to the damping factor of the equation; but if the series is short the damping may 
be considerably less than the theoretical value. 

(6) The correlogram will show a period equal, within limits of error, to that of the funda- 
mental period of the system. The distance from the unit ordinate at k = 0 and the first 
maximum of the correlogram may not, however, be a full period and in estimating periods 
from the correlogram it is better to reckon from upcross to upcross. 

(c) It does not appear that in short series the periodic movement is substantially affected, 
at any rate not to such an extent as the damping. | 

(2) When superposed random variation is present the fundamental period calculated 
from the observed regressions will be too short and may be very considerably so. 

(e) The period of the generated series, defined as the mean distance from upcross to 
upcross, is not quite the same as the fundamental period: but the difference is not likely to 
be important in practice. 
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17. Armed with these results we may consider the difficult problem of determining, for 
a given time series, the autoregressive scheme which may have generated it. 

The first step is to calculate the serial correlations and examine the correlogram. If the 
latter shows fairly regular oscillations there is a presumption that the series is of the auto- 
regressive type, and this conclusion need not be rejected solely because the oscillations do 
not damp so rapidly in the later portion of the correlogram as at the beginning. To take the 
wheat price material given above, an examination of the correlogram (Fig. 1) strongly 
suggests a simple damped harmonic. The sheep series (Fig. 2) is not so clearly defined and 
there is a suggestion of more than one period in the behaviour of the correlogram. This may, 
however, be due to the shortness of the series and one would be inclined to consider it in the 
first place as a single damped harmonic. 

The length of the period, as pointed out above, is best determined by the mean distances 
between upcrosses. In the wheat price data there are upcrosses at about 7-5 years, 17-2 years 
and 26-1 years, giving periods of 9-7 and 8-9 years with a mean of 9-3 years. Much the same 
result is arrived at by counting the periods between troughs. 


18. The second step is to calculate a’ and b’ by equations (21) and to find the period of the 
fundamental harmonic as given by the observed regression equation. For wheat prices we 


have r, = 0-5773, ri, = 0-0246. 
Hence a’ = —0°8446, 6’ = 0-4630, 
whence cos 7’ = 0-6206, 6’ = 51-63°, 


giving a fundamental period of 6-97 years. 

This is too small, and we are immediately led to suspect the existence of superposed varia- 
tion. The problem then arises of determining the variance of the superposed element. If 
this is random, and there are no periodic terms of very short period the variate difference 
method may be used. For the wheat price material, taking differences up to the 10th on the 
primary series (before trend was eliminated) I find for an estimate of the random variance 
var 9 = 27-72. The total variance of the series is 272-8. The constant c of equation (18) is 

27:72 
1— 272-8 = 0-90. 
The putative serial correlations of the autoregressive series will then be 
r, = 0-641, 7, = 0-027 
and a=—1-059, b= 0-652, 
whence cos@ = 0-6551, 6 = 49-07°, 
giving a period of 7-34 years. 

This is still too short. It would require a random superposed variance of about 25 % of 

the total, instead of the observed 10 %, to produce a period of 9-3 years. 


19. This illustrates very well a constantly recurring difficulty in the theory of the variate 
difference method and of time series generally. The superposed variation may not be random. 
Indeed, we have little ground for expecting that it should be. A positive correlation between 
the successive values of 7 will reduce the variance shown as random by the variate difference 
method and unless we have prior reason to suppose that 7 is random the values given by the 
variate difference method are quite likely to be too small. Unfortunately we rarely have any 


prior knowledge of 7, but from general economic considerations one would not be surprised | 
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to find that there do exist positive correlations from one year to the next, owing to the 
enduring nature of some of the causes which can give rise to superposed variation. I conclude 
generally that discrepancies of the type here considered support the view that the period is 
to be determined from the correlogram, not from solution of the regression equation. 


20. Two points may be mentioned incidentally. One, a matter of technique, is that the 
arithmetic of serial correlations and variate differences are closely linked together and the 
results of the one can be used to derive those of the other. This means an enormous saving in 
arithmetic and the method is described in the Appendix to this paper. 

The second point concerns the removal of superposed random variation by graduation 
formulae. It will be clear that if the superposed variation is not random graduation may 
only make matters worse; but even if it is random, graduation formulae may induce spurious 


cyclical effects into the data. It seems to me that as a general rule graduation is to be under- 
taken with great caution. 


21. Reverting to the main topic, it would seem that if the period shown by the correlo- 
gram and the period calculated from the observed regression equation disagree, and cannot 
be reconciled by the assumption of a superposed random element, there is little further to 
be done to dissect the superposed element from the autoregressive part of the system. If, 
however, the variate difference method supplies a variance of 7 which can satisfactorily 
explain the difference in periods, we may go forward. The constant c can be calculated and 
the constants of the autoregressive part of the system determined. Equation (12) then gives 
an estimate of the ratio between the variance of the basic random element ¢ and the variance 
of the generated series. If there is a superposed element it will not be possible to find the 
values of that element at every point and so to determine ¢ at every point; but if 7 is non- 
existent, € can be explicitly determined, except for the first two terms of the series. In fact, 
we merely apply the generating equation to the observed series. the residuals between 
prediction and observation being the values of ¢. An examination of these residuals will 
confirm whether they may be regarded as a random series. 


22. The foregoing treatment can be extended to the case of a more general linear regres- 


sive system Ss 
: Utsm + Oy Upp m—1 t ++» Fm Uy = Etpm 


or to cases where the regression is curvilinear; and the necessity for more general schemes in 
representing observed data can, as shown by Yule, be discussed in terms of partial auto- 
correlations and scatter diagrams. A more serious problem arises if the series ¢ is itself not 
random, a state of affairs which one fears might be fairly common in economic series. To take 
the wheat price data once again, it would not be surprising to find that the wheat price 
oscillations were regenerated by a series of disturbances, part of which were attributable to 
variations in acreages, yields, or the prices of other crops. Such disturbances might themselves 
be oscillatory. For such cases the problem becomes exceedingly complicated. To discuss it 
at all satisfactorily one would require a long series or collateral evidence in the form of other 
series of a similar character. If there is a royal road in this subject it has not yet been dis- 
covered. 
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APPENDIX 
Relationship between variate differences and serial correlations 


If we have a series of values z, ... x, the first differences are x, — 22, etc., the second differences 


2, — 2%, +23, etc., and so on. Let S; be the sum of the squares of the jth differences and write 
for the product-sums 


n-j 
i= la (25) 


The sums S are those appearing in variate difference analysis and the quantities P appear 


in serial correlation analysis. Either set can be expressed in terms of members of the other, 
as follows: 


We have . & =f, (26) 
n—1 n-1 n 

8, => (x; — 254)? = Ph 2 2 Xj Fi4a+ PEG - 2P,—2z3-2x3,- 2P,, (27) 
i= j= j= 


n—2 n—-1 n n—2 n—2 a—l 
S, = D (&j— 2% 5,1 +% 540) = 2% + 423 - 2 xj ~ 4 espa +2 2 ts hjo2 — 4D Ujtp41 
j= i= = = - 


j= 
= 6F,— 8P, + 2P,— a} — xy, — (2a, — %_)?— (22, — 2,4)? (28) 
and so on. For the purpose of expressing the general formulae of this kind it is convenient 


to modify the sums S. Suppose we write the series x, ... z,, preceded and followed by a number 
of zeros. The difference table will then appear as follows: 


0 
0 
0 0 
0 —% 
0 +2, 
—%, 32, — Xe etc. 
Ly —2%,+2, 
Ly —Xq — 3%, + 32%,—23 
Xq Ly — 2%_+Xz 
L_—Xs XL — 3%, + 3x3 — 2, 
Xz X_— 2%3+ 2, 


with a symmetrical effect at the other end. Writing now 7,, 7), etc., for the sum of the squares 
of members in the first, second, etc., column of differences we see that 


T; = ~ {x ba (?) Teta + (3) Leta: \ , (29) 


where the summation now takes place over all values of x and there are no com- 
plications introduced by end effects. In fact, we have thrown the end effects into the 
sums 7' which replace the S’s. In actually calculating the 7’s from the S’s it is very little 
trouble to add the extra terms to the tables giving the latter; and when calculating the S’s 
from the T’s only the differences at the end of the table need be worked out. 
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We have then from (29), on expansion 


= Fl(t) +() +~--2-ore [Al ()+-+(2)@]} 6 


The coefficients of the various P’s are easily seen to be equal to corresponding powers of ¢ in 


(e-(-Qe-e-Gers-} 


ie. in (— 1)/(1—#)”’, and we find, on substitution in (30), 


T, = R(?)- 2A,( 7 )+2R(,7, ») +... +2(- 1B, (31) 
For example 
T, = fi, 
T, = 2P,—2P, 
T, = 6P,— 8P,+2P,, 
T, = 20P,—30P, + 12P, —2P,, 
T, = 70P)—112P, + 56P,— 16P, + 2P,, 
T, = 252P, — 420P, + 240P, — 90P, + 20P, — 2P,, 
T; 
T; 


924P, — 1584P, + 990P, — 440P, + 132P, — 24P, + 2P,, 
= 3432P, — 6006P, + 4004P, — 2002P, + 728P,— 182P, 4 28P,—2P,, 
T, = 12870P, — 22880P, + 16016P, — 8736P, + 3640P, — 1120P, + 240P, —32P, + 2P,, 
T, = 48620P, — 87516P, + 63648P, — 37128P, + 17136P, — 6120P, + 1632P, — 306P, 
+ 36P, — 2P,, 
Ty) = 184756P, — 335920P, + 251940P, — 155040P, + 77520P, — 31008P, + 9690P, 
— 2280P, + 380P, — 40P, + 2P iy. (32) 


The coefficients check in virtue of the fact that they sum to zero. 
Conversely we have 


2P, = 2T,, 

2P, = —7,+2%, 

2P, = T, —4T, + 27), 

2P, = —T, + 6T, — 97, + 27), 

2P, = T,—8T, + 207, — 167, + 27%, 

2P, = —T,+ 107, — 357, + 507, — 257, + 27), 

2P, = T,— 127; + 547, — 1127, + 1057, — 367, + 27%, 

2P, = —T,+ 147, —77T, + 2107, — 2947, + 1967, — 497, + 27, 

T, — 16T, + 1047, — 3527, + 6607’, — 6727, + 3367, — 647, + 27, 

2P, = —T, + 187, — 135T, + 5467, — 12877; + 17827’, — 13867, + 5407, — 817, + 2%, 


2P,, = Ti) — 207, + 170T, — 800T;, + 22757, — 40047, + 42907’, — 26407, + 8257, 
— 1007, + 27). (33) 
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The coefficients sum in turn to 2, 1, —1, —2, —1, 1, 2, ete. The coefficient of T;, in 2P; is 
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COMPARISON OF THE CONCEPTS OF EFFICIENCY 
AND CLOSENESS FOR CONSISTENT ESTIMATES 
OF A PARAMETER 


By R. C. GEARY 


Having given two estimates X and Y of an unknown parameter 0, E. J. G. Pitman (1937) 
has suggested that X should be regarded as a ‘closer’ estimate of 6 if the probability that 


|X-0|<|Y-@l (1) 


is greater than 4. This concept has intuitive appeal and it is in accordance with statistical 
tradition that preference should be expressed on a probability scale. The object of this com- 
munication is to compare ‘closeness’, as defined, with the familiar concept of relative 
‘efficiency’ (Fisher, 1922) as determined by the variances of the two estimates. Continuous 
variaticn is assumed throughout. 

Pitman has shown that the median has a special role in his theory of closeness and, since 
the median is notably unamenable to algebraic treatment, it is not to be expected that, 
despite its apparent simplicity, condition (1) should be readily expressible in terms of the 
semi-invariants (assuming these known firmly or approximately) of the joint distribution 
of X and Y. 

Suppose that X and Y are consistent estimates of 0, i.e. that 


EX = EY =8@. (2) 


In this connexion it must be observed that, in not necessarily large sample theory, this 
condition need not be observed. In fact, Pitman (1937, p. 215) has shown that in estimating 
the variance of normal samples of n, the ‘closest’ estimate of the variance is 


n 
8, = } (x; —%)*/(n— §), approximately, (3) 
=1 


°. 


which is not consistent, instead of the usual 


85 = 3 (%—2)4/(n—1), (4) 
which is. 
If the joint distribution of X and Y is symmetrical, i.e. if 
Aeg = Aji (5) 
for all joint semi-invariants of powers i and j, then it is clear that 
Prob {| X —0|<| Y—6]|} = Prob{| Y—6|<|X-—6}}, 


and, since the sum of the two probabilities is unity, each must be equal to }. It follows that, 
when the two estimates are distributed on the normal surface of error with equal variances, 
the probability of (1) is }, since in this case A;; = A,,;, the semi-invariants being zero for all 
values of i and j except those for which (i +7) = 2, where the sets for (7, j) of (2, 0) and (0, 2) 
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are the variances of X and Y respectively. It can readily be shown that the probability of 
(1) in the normal case, with variances of a? and o? and coefficient of correlation p is 


9 2 

1 — (*22xt et ' 
c 1 2 2 -_ a > 

7 ~~ 


(6) 


which shows that in this case the criteria of closcaess and efficiency are identical, since, for 
all values of p, the probability is greater than, or less than, } according as @, is respectively 
less than, or greater than, 7,. 

Under general conditions the joint distribution of 


x=X-6 and y=Y-@ 
will be of the form 


f(x,y) = Ex {! ais =i i? ’ Ola y) dx dy 7 

ellis isj-3 blo! “\Ox] \oy a ' (7) 
P(x, y) being the normal function . 

: | - l — a 2px y y* | 27. /(1 — p2 ‘ 

exp | x1 H\ 2 tes 2 | 27 (1 —p*). (8) 


If the two estimates are computed from samples of n it is well known that 

Ai; aia} 
is of order }(2—7—j) in n. It is natural, accordingly, to try to find the value of probability 
of (1) when the variates are distributed in (7) assuming that the A;; are zero for all but the 
smaller values of (¢+7). Probability of (1) is given by 


f° + 00 r+ ly! 
| dy | S(x,y) dx. (9) 
-« —lyl 


To find the value of this integral the frequency f(x, y) is regarded as the product of the normal 
function ® and a polynomial in x and y determined from (7). It can easily be shown that 
polynomials of odd order in (¢+j) vanish on integration. Accordingly the integral (9) is 
given by (6) plus a term O(n~"); furthermore the expression for the probability contains only 
terms in n~“, where & is a positive integer. 

There is no theoretical difficulty about the computation of the coefficients of powers of 
A,;/0,0; in (9) but the algebra is complicated. For the purpose of this note it will suffice 
perhaps to state that when 


1=%, Ay = for (t+j)>4 and rE, = 0 for k>2, (¢+j) = 3, 


then the probability (1) is given by 


+ == (I — p®)~#{6(1 — p*) (Ago — Aga) — 12(1 — p?) (Aga — Ais) 
— (7 + 2p*) (Ag§ — Ags) + 36(Agq Aga — Ajo Ags) 
— 3(1 + 2p*) (2Ag9Aje + 3Ag} — 2A3 Ags — 3Aj3)}, (10) 
with Ny = Ay O70}. 


The term in addition to } may be regarded as O(n-), the terms neglected being O(n-?) when 
n is large. When 7 is not large it is evident that the added term may be quite appreciable 
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when Aj; is significantly different from A,;; and more particularly if p is nearly unity, which 
will usually be the case with two estimates which are of almost equal efficiency. In a par- 
ticular example (which, however, had no reference to the problem of estimation) the values 
found were as follows: 


p = 0-953; so = —0°273; Aj, = —0-287; Aj, =—0-302; A‘, = — 0-307; 


‘9 = —0°563; Aj, =—0-513; Aj, = —0-466; AL, =—0-418; A‘, = —0-327. 


For these values of the semi-invariants the value of (10) is 0-5185, which does not differ 
much from the value $ which would be obtained on the assumption of equal variances and 
normal distribution of the estimates; it is equally certain that, when the estimates are com- 
puted from large random samples and when their joint distribution tends towards normality 
as the sample number tends towards infinity, the probability (1) will be exceedingly close 
to 4; there is no good theoretical reason, however, for thinking that for estimates computed 
from samples which are not large, the value of (10) (or of the more exact value of the pro- 
bability which would be found by taking into account further terms in the expansion of (7) 
in the compilation of (9)) would be close to 4, in general. 


I] 


SOME PARTICULAR CASES 


As an application, consider the method of estimating the universal mean of a normal dis- 
tribution, give a random sample of n, assuming that the universal variance is unity. The 
distribution is, accordingly, 
1 
—— e-te-? dz, 
(27) 
Since the maximum likelihood estimate of @ is the mean 
1 n 
A=— > % 


its variance in large samples must be less than that of any other estimate Y of the parameter 
(Fisher, 1922, 1935). It is well-known that the median y of large samples of n from a normal 
universe with mean zero and variance unity is distributed approximately as 


1 (n —n cf 
AL adie 


whereas the mean 2 is distributed as 


a a 
Jk) 2 dz, 


so that the respective variances are ,/(77/2n) and 1/,/n. It has been computed that for large nor- 
mal samples the correlation between the mean and the median is p = ,/(2/7). From formula (6) 
Prob {| X-—@|<| Y—@|} = 0-615. 


. 


Pitman (1937, p. 221) has shown that the closest estimate of the centre of a rectangular 
distribution is the mid-point (or the mean of the largest and smallest members of the sample). 
The rest of this section deals with comparisons, by means of the criteria of closeness and 
efficiency, of the mid-point and other consistent estimates of the centre of the rectangular 
distribution of which the range is known to be unity. 
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It is a well-known fact that by the variance test the mid-point is more efficient than the 
arithmetic mean as an estimate of the middle point of the range for a rectangular universe 
when the range is known. In fact the respective variances are as follows: 


Variance 
Mid-point: 1/{2(m + 1) (n+ 2)} 
Mean: 1/{12n}, 


so that the variance of the mid-point is actually of a lower order of magnitude in n than the 
variance of the mean. The respective distributions (when the centre is zero) are as follows: 


Mid-point: n(1—2|t|)"—dt (11) 
Mean (Gram-Charlier, correct to n-*): 


dy [ 1 (<) 1({1 (<) 1 ()"| 
etd et Mater Bie tbe ed Bene 
(27) 20n\dy} ~ n?\105\dy) * 800\dy 


1( 3 (d\8 1 /(d\ 1 (da)\") adie 
~lraaag) “crate rs, |+..|é vy 


y = x,/(12n), x being the mean of sample of n. 
The distributions of the rectangular mid-point and lowest point (used later) have been given 
by Neyman & Pearson (1928). The £, of the distribution of the mid-point is given by 
Ss 6(n + 1) (w+ 2) 
2™ (n+) (n +4)’ 
which tends towards 6 (instead of the normal value 3) when n tends towards infinity. The 


distribution of the mean tends rapidly towards normality when tends towards infinity. 
Formula (6) cannot be used in this case. Actually it is found that, when n is large, 


6 
y= sow ae 9S 8 +t, 
Prob {| «| <|#]} (a) 1-38” (14) 
It is interesting to compare this probability with the probability which would be found from 


formula (6), i.e. on the assumption that both estimates were normally distributed. The 
coefficient of correlation between mean and mid-point is 


6n 
=> -_ = 6 —t 1 
0= | anees) ™ 
and the pseudo-probability required is approximately 








(13) 








is ap = 1-56n-+, 
7 


which is very little different from the true probability at (14). 

In the previous application one of the two estimates (namely the mean) was approximately 
normally distributed. The mid-point is now compared with another estimate, the variance 
of which is of the same order of magnitude (in ”) as that of the mid-point, but the distribution 
of which does not tend towards normality with increasing n. 

This other estimate is the lowest point in the sample less its mean for = 0. This estimate is 
clearly consistent. Since both estimates are consistent their deviations may be regarded as 
measured from 9, the unknown centre of the rectangular distribution. 
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In the range (— }, + 4) the distribution of the lowest point is given by n(4—1u)"—"du, the 
mean value of which is —k = — }(n—1)/(n+1). The proposed estimate is (u+k). If the 
largest value is v the mid-point estimate is ¢ = }(w+v). Hence it is required to consider the 
probability of jutk|<duto. (16) 
The joint distribution of u and v is given by 

n(n—1)|v—u|"-"*dudv. (17) 
In the (u, v) plane the inequalities (10) define two ‘critical’ straight lines given by 

v—u—2k=0 and v+3u+2k=0. 
The required probability will be found by integrating (17) over certain areas bounded by 
these lines and by the lines u = + }, v = + }. The probability is a very complicated function 
of n which, however, reduces to 
3e-4 — le-2 = 0-317, (18) 

when ” = 0. Accordingly the mid-point is a much closer estimate of 6 than the lowest point 
less its mean for 6=0. 


As regards efficiency, the variances of u and ¢ and the coefficient of correlation between u 
and ¢ are as follows: 


o% = ni/{(n+1)?(n+2)}, oF = 1/{2(n4+1)(n4+2)}, py = V(m+1)/,/(2n), 
so that, if w and ¢ were normally distributed, the probability that |u—u|<!é|, from (6), 
is given by rT 
; tan” (2./(m+ 1)/,/(m — 1)) 


which, when x tends towards infinity, tends towards 0-352. This is not very different from the 
true probability 0-317. 


A FURTHER ILLUSTRATION: ‘CARS IN A TOWN’ 


At a recent meeting of the Dublin University Mathematical Society, E. Schrédinger 
suggested the following ingenious problem as an illustration of Pitman’s concept of closeness. 
In a town, cars are known to be numbered consecutively from 1. The numbers on r of the 
cars are noted: the problem is to find the closest estimate of the number of cars in the’town. 
Following is the solution, on Pitman’s lines, of Schrédinger’s problem. 

Let n, the unknown total number, be assumed to be so large* that variation is continuous, 
i.e. that any car number observed at random has a rectangular frequency distribution. 
The highest of the r numbers observed, namely w, is a sufficient statistic for n because, 
when w is known, the remaining (r—1) variates have a joint frequency distribution 
independent of n; hence all relevant information can be derived from w; the remaining 
(r—1) observations may be ignored. The cumulative frequency distribution of w is 


w"/n*. 
By Pitman’s theory the closest estimate will be that for which the observed w has the 
median value, i.e. if 2 is the estimate of n, 
(w/a) =} or A=2w. 
It is clear that 2" w has median value n. If r = 1, & = 2w, and if r tends towards 00, # tends 


towards w, both of which are reasonable. That # actually is the closest estimate transpir 


* It seems likely that the solution is valid for all values of n. 
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from a theorem of Pitman (1937) to the effect that if # has median value n, and if n’ be 
any other estimating function, then 2 is a closer estimate of n than 7’, i.e. 
Prob {|JA—n| <|n’—n|}>}, 
provided that a variable z can be found, always of the same sign, so that 
nr and 2(n’—fh) 

are independent. 

In the present application suppose that »’ is another estimate of n. From considerations 
of scale, x’ must be homogeneous and of degree 1 in the observation w and the other 


observations w,, Ws, ..., W,;. Hence n’/A must be homogeneous and of degree 0 in w, w,, 
Wy, «-., W,_y, and therefore expressible as a function of 
q; = W;/¥, = he, 2.5 F 4. 


Now it is obvious that the g;, and hence n’/%, are independent of w. Therefore taking z as 
1/2, which is always positive, we see that % is the closest estimate. 

To find an upper limit (in probability) of n, we express the fact that m should not be so 
great as to render too unlikely the occurrence of the largest number actually observed w. 
Accordingly, pre-determine a probability « and set 

(w nn)" >a. 


Hence n<w/atr, 


Example. The number of 30 motor cars are noted and the largest number is found to be 
247: to estimate the number of cars in the town and the upper limit of error of the estimate. 
Here 7 = 30, w = 247, so that 

n = 2130 x 247 = 253 approximately, 
and n < 247/(0-05)"8° = 273 approximately, if a = 0-05. 
The latter statement means that the number of cars in the town will be less than 273 unless 
in taking the particular sample an event, the probability of which was 1/20, occurred. 


SUMMARY AND CONCLUSION 

The criterion of efficiency as determined by a comparison of the variances of two estimates 
of an unknown statistic is identical with the criterion of closeness when the joint distribution 
of the estimates is normal and the criteria will not yield significantly different results in 
practice when the estimates are estimated from large samples and are consistent. Study of 
some particular examples suggests that, even when the distribution of the estimates is very 
different from normal the value of the probability associated with the criterion of closeness 
may not be very different from what its value would be if normality of the joint distribution 
were assumed. An application of Pitman’s theory of closeness is discussed. 


The author is much indebted to Prof. E. 8. Pearson for his helpful criticism of this Note, 
and to Mr N. L. Johnson for pointing out an error in algebra in the first draft submitted. 
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THE RELATION BETWEEN MEASURES OF CORRELATION IN 
THE UNIVERSE OF SAMPLE PERMUTATIONS 


By H. E. DANIELS, Wool Industries Research Association 


1. IyTRODUCTION 


Recent papers by Hotelling & Pabst (1936), Pitman (1937), Kendall (1938) and Kendall, 
Kendall & Babington-Smith (1939) discuss the distribution of the correlation coefficient 
when members of the sample corresponding to the two variates are permuted randomly 
relative to one another. In the case of rank correlation, the characteristics of the population 
sampled are generally unknown, and a significance test has to be based on the distribution 
obtained from the sample in this way. 
Hotelling & Pabst prove that as the sample size is increased, Spearman’s p tends to follow 
a normal distribution law. Kendall’s measure of rank correlation, 7, in which all possible 
corresponding pairs in two given rankings are assigned marks + 1 according to whether they 
agree or differ in order, follows a specially simple distribution law which tends rapidly to the 
normal form and becomes highly correlated with Spearman’s p for samples of moderate size. 
The present paper discusses the properties of the class of correlation coefficients [obtained 
on replacing Kendall’s marks +1 by a more general system of scores. By an empirical 
argument Kendall et al. showed it to be likely that the correlation between 7 and p is 
2(n + 1) z : Le: 5 i Sere en 
[2n(Qn + 5)] for all values of the sample size n, and surmised that their joint distribution 


\ 
tends to the bivariate normal form for large n. These results are, in fact, special cases of the 
relations demonstrated below between two correlation coefficients I’ with different systems 
of scores. 


2. DEFINITION 


Consider the two sets of m sample values 


Ba, Bq, i<3 Bags Sis Bes °=s Sw 

both arranged in some given order relative to each other. They may be permuted to give 
n! different ways of grouping the z’s with the y’s. Let us assign to each pair (x;,x;) what for 
convenience will be termed a score a;; and to each (y;, y;) a score b;;, where 


Ly ss -*. 
Denote by I’ the number P= Sa 
: {(Xa?; Xb?) 
the summation extending over all i and j from 1 to n. Special cases of J’ are Kendall's 7, 
the product-moment correlation coefficient 7 and Spearman’s p, for 7 is obtained by definition 
7 -_ 47% is cive The = 7:-—2: + = Y:—Y; 7 vir -] 2 identity 
when a,;, b;; = +1, j2%, ris given when a,; = x;—2,, b;; = y;—y; by virtue of the identity 


$= (ej — 2%) (Yj—Y) =X UVxyi—UUAY; 
i i ij 
and p is similarly obtained when a,;, b;; = j —1. 
When the 2’s are permuted relative to the y’s, the scores reappear in a new order with 


the same or opposite sign and the denominator of J’ remains unaltered, so that in discussing 
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the distribution of I’ over all permutations it is sufficient to consider the numerator only, 
which we denote by c. 


Write A, B for the matrices of the scores; for example, with n = 4, 


A= 0 Ae Ag Ag]; B= 0 By yg Og 
—a, 9 Ge, Ang —b, 0 bes O24 
—Ay3 —G93 DO Agy —b,, —be, 9  dyy 
—Ayy —Ay —Ay O —by, —byy —by, 0 


With the z’s and y’s in the order as written, c is the trace of the matrix product AB’ 
(i.e. the sum of the elements of its leading diagonal), where B’ is the transpose of B. The effect 
of a permutation of the 2’s, say, is to alter the score matrix of the x pairs to PAP’ and the 
value of c to the trace of PAP’ B’, where P = (p;,;) is the appropriate ‘permutation matrix’ 
obtained by permuting the columns of the unit matrix. For example, corresponding to the 


grouping gq, Ly, By, Tg, Yrs Yo Ys» Ya» 
the permutation matrix is re ee 


In terms of the matrix elements c is given by 


C = LpEPy yb 5p, 
all suffixes being summed from 1 to n. 


3. MomeENtTs 


The distribution of c over all permutations, or the joint distribution of two c’s with different 
systems of scores, is most readily discussed from its moments. The product moment of 
c(1) and c(2) with scores af, 6 and a, b® respectively is 

—— a 

e(1)e(2) = — S(¢(1) (2), 


where C(1) c(2) = 2 pj, Pi Per Pur OAD OM, 
and S denotes summation over all ! possible permutations. Consider the effect of S.on 
each term. The non-vanishing contributions occur when all four p’s are’1, and it is first noted 
that if any of the suffixes 7, 1, r, t are equal, the corresponding suffixes in j, k, s, w must also 
be equal for the term not to vanish, since each row of P contains only one non-vanishing 
element. Terms in which i = | or r = ¢ are of course zero by definition. When, for example, 
i = r so that r is replaced by ¢ in the expression, we shall call i a tied suffiz. Other suffixes 
will be referred to as free suffixes. 

As regards their contribution to S the terms may be classified according to the number 
of tied suffixes in i, 1, r, t as follows. 

(i) No tied suffixes. When the four p’s are each unity, four rows and columns of P are 
assigned and there are (n — 4)! ways of filling the remaining positions. Such terms therefore 


contribute (n — 4)! S’aWay T'oMo® 


to the total sum, where 2’ denotes summation over all values of the suffixes which are not 
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equal. Let us consider some properties of 2’a,a,, and La,,a,, and similar expressions with 
tied suffixes, the second expression being summed over all values of the suffixes. (Super- 
scripts (1) and (2) are understood throughout.) 


(1) 2a,,a,, = 0, 2’a,,a,, = 0, and so on. 
(2) Layay = L’ayay. 
(3) 2agay = L’ayayt L’agay. 


(4) 2a,a,,= 0, 2’a,a,,= 0. The first is true because La,;,= 0, and the second follows 
from the fact that 


20 yy = LAA t+ LAA + ZOO; + 2A Ay + Za y4,,+ L'aya;,+ Z'aya,;, 


the terms on the right after the first cancelling in pairs. 
The contribution to S of terms with no tied suffixes is therefore zero. 


(ii) One tied suffix. For the term not to vanish it is necessary to assign three rows and 
columns of P, and the contribution to S from such terms is 


4(n — 3)! S’aPaP T'bQbe, 
the factor 4 arising from the fact that the same contribution is obtained by tying the suffixes 
in the four possible ways. 
(iii) T'wo tied suffixes. The contribution to S is similarly found to be 
* 2(n— 2)! L’aPaH T'bDb®. 


Terms containing more than two tied suffixes give zero contributions to S, and finally, 
substituting for 2” the appropriate X expressions, we find 

e(i)¢(2) = ain —1)@—3) (Lafpal) — LatPatp) (LooF! — LOW) + = ee | 
The moments of higher order can be obtairied by a similar procedure, but the expressions 
rapidly become unwieldy. 


@) (2) SADA) 
Lay) ay? ZOSpb%. 


4. THE CORRELATION BETWEEN KENDALL’S T AND SPEARMAN’S Pp 


As a first application of the formula we consider the correlation between 7 and p over all 
permutations of the sample values. The scores for 7 and p respectively are 

af), 6b =+1,0 when j2i,j=%, 

a®, b® =j-i. 


The following results are easily derived 


n n 

LaP =n+1-2i, Ya} = jn(n+1 -2%), 

l=1 I=1 
nn n n?(n? — ) ni mn n(n? — 1) 
yy Lapap=-———, & Vaal =. 
i=1 l=1 t=1 i=1 l=1 


and the same results hold for the 6’s. Substitution in the formula then gives 


(1) (2) = n*(n— et 1)? 
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: ee ed) ee 
Again, = = he ay = > x zai aj; = n(n—1) 
dun En] Em i=1 l= 
n n n n> n?2 — l n n n n2 al | 
and SE Sepa ="), FS Sapap =), 
i=l l=1 t=1 12 i=1 I=1 


from which it is found that 








2n(n — 1) (2n+5) 
ee sane: 





_ n4(n—1)(n+1)? 
wie 


e(1)? = e(2)2 
The required correlation is therefore 

Ra 2+) 

pr I[2n(2n + 5)] 








which is the result anticipated by Kendall e¢ al. It should be noted that they use the quan- 
tities Y = $c(1) and . 
ste say — t= D)_ 2) 
6 n 


in place of c(1) and ¢(2). 


5. TRANSFORMATION OF THE SAMPLE VALUES 


If the scales of the x’s and y’s are distorted by a transformation and the product moment 
correlation coefficient r is recalculated on the transformed sample, a new value r’ is obtained. 
In particular, the x’s and y’s may be readjusted to be at equidistant intervals, and then the 
new value of r is Spearman’s p. The formula for c(1)c(2) can be used to find the correlation 
over all sample permutations between the values of r on the same. sample before and after 


such a transformation. Distinguishing by primes the sample values after transformation, 


the scores are 
oF x ae ee 


SS a Oe oes’ 
ay = Xj;— 2, OF = y3— YH. 


n n n n n 
Then Tap =ne—-z), EE LalaP =n? ¥ (e,—z)(x\-7'), 
= = = = = 


n 


n n mn 
( = —\- 
~ Zawar - a Ae —%;) (%,— 24) = 2n2(x,;—Z) (x; -Z’) 
i=1 l= i=1 l= 


by the identity previously quoted. Using these and similar formulae we find 


4n2 








(1) ¢(2) = —— 2e—2) (x' - 2’) Z(y—9) (y'-9’), 
oes He 4n2 ne 2 
of = Sez) 2-H CB = De’ ZPD’), 


and hence the correlation between r and 7’ is ’ 


Rey = Var yy> 


where r,,, and r,,, are the correlation coefficients between old and new values of x and y 
respectively. 
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6. TENDENCY TO NORMAL FORM FOR LARGE n 


It will now be shown for a large class of score systems a;; that c, and hence J’, tends with 
increasing n to be normally distributed, and moreover, that the joint distribution of any 
pair of such I”’s tends to the bivariate normal form. 
The pth order product moments of the joint distribution of c(1) and c(2) are sums of terms 
containing LA gn Vij Vy --- 2b, Op, Dow +++ 
or similar expressions in which arbitrary groups of suffixes within the 2’’s are tied, each 
2” involving products of p scores which may belong either to systems (1) or (2). Every 
such 2” is in turn a linear combination of the corresponding 2 having the same suffixes and 
other 2’s in which additional tied suffixes are introduced. No X may contain a pair of free 
suffixes attached to one score, for it would then vanish by virtue of the fact that 2a,; = 0. 
The even order product moments are first discussed. Let p = 2m. Consider a 2 in which 


the 2m scores are divided into m pairs each having one tied suffix, so that there are in all 


3m independent suffixes, e.g. 
P » ©.§ 25 5D 5p Upp Ayg Ayy, Uy --+ 


It may be written as (LaWaW) (LaWa)- (LaPaQ)”, 
where A+4+v = mand A, p, v are the number of times the scores are paired in the com- 
binations indicated. 

As is always possible, suppose the numerically largest value of a;; to be made equal to 
unity. We now impose the condition that 2a;,,a,, is of order n* whether a;; and a; belong to 
the same or different systems of scores. This is satisfied, when maxa,; = 1, by 7 and p, 
and also by r provided the sample is not an unusual one. With this condition, it is seen that 
~’s of the above type are of order n*”. 

It is next observed that all other ways of tying suffixes give 2’s of lower order of magnitude. 
For the order of magnitude of the bracket is not reduced on replacing each a;; by + 1; con- 
sequently if further suffixes are tied the order of 2 is made less than n*” since there are fewer 
than 3m summations from 1 to n. It follows that the dominant term in a 2” is the corre- 
sponding 2 having the same array of suffixes. 

Moreover, every non-vanishing X involving 3m independent suffixes can only be a per- 
mutation of the type illustrated, while those with more than 3m different suffixes must all 
vanish. This is made clear by considering how the 3m suffixes can be arrayed between the 
2m scores. Begin by assigning 3m different suffixes at random among the 4m available 
places. At least m scores will receive their full complement of suffixes all which will be 
different. There cannot be more than m such completed scores, for if 2 is not to vanish, 
at least one suffix of each complete pair must be tied and this can only be done by repeating 
one suffix from every complete pair in each of the remaining places to be filled, of which 
there are only m. We are thus led to a permutation of the type of X discussed above. If 
there had been more than 3m different suffixes to begin with, there would not have remained 
sufficient empty places to prevent the existence of at least one score with a pair of free 
suffixes, and so all 2’s with more than 3m different suffixes must vanish. 

Any 2mth product moment is the sum of terms like 

der AZO jy... Z'bygd 


tu s**> 


where f is the number of independent suffixes in the 2”’s and A is a coefficient which is of 
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unit order as far as n is concerned. From the preceding argument, the maximum value of 
f is 3m, in which case the term is of order n-3™ x n3™ x n3™ = n3™, When f < 3m — 1 the order 
of the term is not greater than n~3"+1 x n3™—1 x 73m-1 = n3"—1 and such terms may therefore 
be neglected. Write 


hy, = ZaPaQ ToVoM, hy, = LaPaYToVb2, he, = LaPal LoDo. 
Then if terms of lower order of magnitude are neglected, the even product moment 
rs = e(1)" e(2)°, r+s = 2m 
is given by the sum of terms like 
am mA, phy rrighs; 2At+p=r, wt+2Ww=s, 


over all possible values of A, , v. The coefficient A aye which is the number of ways in which 
hi hh, hoe can arise, is calculated as follows. Consider a X whose array of suffixes is such that 
it can be factorized as (LaMalP) (LaPaQy (LaPa}). Its suffix pairs can be permuted 
within the sets of scores (1) and (2) in r! s! ways, but of these A!(2!)4 w! v!(2!)’ give essentially 
the same 2. The suffixes within pairs attached to each score may also be rearranged in 
22" ways without affecting the result, and so 


r!g!Qam rls! gmt 
Army = Alplvi2y” “Alplot* 





The calculation of the even order product moment y,, for large n is in fact tantamount to 
selecting the coefficient of #’t*/r!s! in 
—— i (hyy ti + Qhypty ty + hygtZ)™. 

Finally, we dispose of the odd moments. In certain cases, such as for example the joint 
distribution of 7 and p, they all vanish by symmetry. But even in the general case it can be 
shown that the odd moments are negligible to the order of magnitude n-+. 

A 2 containing 2n+1 scores cannot have more than 3m+1 different suffixes. For if 
there were 3m + 2, let them first be assigned to the 4m + 2 available places; at least m+ 1 
scores will receive complete pairs of suffixes, and the remaining m empty places cannot be 
filled in any way which avoids one score having a free pair of suffixes. Hence as before 
the order of magnitude of any (2m + 1)th moment is at most n—@™+D x m3mt1 x n3mt1 = y3mt1, 

The 2mth moments were shown to be of order n*™, consequently if we define 


y(1) = n-*e(1), (2) = n-*c(2), 
the joint distribution of (1) and (2) has all its even moments of unit order, and by the 
result just proved all its odd moments are of order n-* and may therefore be neglected to 


that order. Reverting to c(1), c(2), it is seen that the moment-generating function of their 
joint distribution tends in the limit to the form 


exp— ni 2 hull + 2hyoty te + gat). 


Hence c(1) and c(2) tend to be normally distributed with variances pruate tall and 
correlation . 
Iyp Latpay? Doh bf 
Mbyte) (ZafpaG RasPaky) /(LOPOY TOO) ° 
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The I's similarly tend to a bivariate normal distribution wich the same correlation, but 
with variances, 4 Za a) Sow bm 4 La?) a?) ZbPb2 


n§ SaWat) Xb? b®? 03 SaPa® F252" 

Our proof rests on the assumption that 2a,,a,, and 2b,,b,, are of order n*, where the 
individual a;;’s and 6,,’s may belong to either score system. But if that is true, it follows 
that expressions like La?; must be of order n?, for they cannot be made to exceed that order 
on replacing a;; by +1, and their order cannot be less than n? since 


1 = 
2ai;— aii Mik = 2(a,;—4,)? 20, 


where a; = 
1 


n 
Ya,;. Consequently the variances of the I”’s decrease like n-'. The correlation 
jf 
I’s tends, however, to a value independent of n in the limit. 


oO Si — 


between th 


SUMMARY 


The properties of a general class of correlation coefficients I’, which includes the product- 
moment correlation coefficient +, Spearman’s p and Kendall’s 7, are discussed. A direct 
proof is given of the formula tentatively suggested by Kendall for the correlation between 
p and 7 when the sample is permuted in all possible ways. The effect of a transformation of 
the sample values is also considered. It is shown that under certain general conditions, the 
joint distribution of two different ‘J”s, calculated on all possible permutations of the sample 
values, tends with increasing sample size to the bivariate normal form with variances in- 
versely proportional to the sample size and correlation independent of it. 
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1. INTRODUCTION 


Pioneer work in trapping field mice alive has been done in this country by Charles Elton, | 


his colleagues and Dennis Chitty. Reference to similar work in other countries, especially 
in the United States, will be found in their papers and in the useful lists of recent 
literature in the Annual Reports of the Bureau of Animal Population, Oxford University. 
We record with gratitude the benefit we have derived from their labours and devices. 

Our own interest lay primarily in comparing, by simple statistical methods, populations 
from species of mammals distributed more or less continuously over great distances. These 
comparisons were to be based on skeletal measurements and proportions. We were limited 
to our own country and to a ubiquitous mammal easily caught in large numbers; thus field 
mice seemed to be the only choice, although for skeletal measurements these small rodents 
present great difficulties. Our Oxford advisers recommended the long-tailed field or wood 
mouse, Apodemus sylvaticus syivaticus, as more or less ubiquitous and easy to catch. We 
decided to follow in their footsteps, and start by trapping these mice alive, marking them, 
setting them free and trapping again, in the hope of getting some first-hand experience of 
how far they wander, and whether they are confined to particular types of country or can 
be regarded as a single population continuous throughout England. 

Even if a whole species can be regarded as a continuous population it is clear that there 
can be no such thing as ‘random mating’ within it, if the individuals rarely wander more 
than a few hundred yards. There must be regional inbreeding, though there may be no 
boundaries between the regions. Can any regional differences be detected between field 
mice if measurable characters are compared by statistical methods? 

The trapping of Apodemus alive is a fascinating pursuit. Many more questions arise from 
it than can be answered perhaps in a lifetime, and it has proved hard to hold firmly to the 
pursuit of our original problem. In trapping over a number of seasons it has been easy to 
collect a vast quantity of facts, and these, when arranged and digested, cry out for the 
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collection of a vast quantity more. It has seemed advisable, however, to try to publish 
some of them at this stage, in the hope that they may prove of interest or use to other 
workers, if we ourselves are not able to follow them up much further. 

The data may be arranged, though with much overlapping, under the four headings: 
Growth, Variation, Wandering, Survival. We hope to publish some selected facts appro- 
priate to these headings in a short series of papers of which this is the first. 


2. TRAPPING TECHNIQUE 


That many of our present records are not suitable for statistical treatment is due to our 
ignorance when first trapping of the best method of laying out our traps. Only now, after 
five seasons’ trapping following on the work of Elton and of Chitty, are we beginning to 
understand the conditions under which large numbers of Apodemus can best be caught. 

There are four main limiting factors: 

(i) Man hours. Elton et al. (1931, p. 661) stressed the number of ‘man hours’ needed for 
setting, visiting, and resetting large numbers of traps, and for dealing with the mice that 
are found in them. The largest number of traps we ourselves have ever set out at a time was 
96, and 48 mice was our largest night’s catch, of which 4 were bank voles and 44 A podemus. 
Our method was to pick up all the occupied traps, replace them with spare traps, and take 
them home. There we identified or marked the mice, weighed and measured them, placed 
them in cages, and wrote up our day’s records. Then we rebaited the traps and renewed 
the food in the attached nest boxes (see p. 145), so that they were ready for taking out to 
replace others next day. Our biggest catches meant a long, intensely hard day’s work for 
three practised people; had there been more of us we could have put out more traps 
covering a wider area and caught more mice, and so have had more adequate data for 
statistical treatment. 

(ii) Weather. Setting out a large number of traps does not necessarily mean catching a 
large number of mice. On some nights we have had all our traps out in likely places without 
a single catch. Weather conditions undoubtedly play an all important part in keeping mice 
from wandering freely, and this in itself needs a carefully planned investigation. We have 
learnt from experience that a night of blustering south-west wind is a good trapping night 
and that, if snow is on the ground, it is of little use leaving out the traps when once the mice 
living in their immediate neighbourhood have been caught; but we are not sure of the 
relative effect on the numbers caught of wind, moon, temperature, rain or hoar frost. 

(iii) Arrangement of traps. Only a certain number of mice will be within reach of any one 
trap under any particular set of weather conditions, and a great number of traps in a dense 
‘scatter’ will catch no more mice under those conditions than few traps more widely spread. 
A line of traps will drain a wider area than an equal number arranged in a grid, while a 
hollow square may catch all the mice within it just as easily as a grid covering the same area. 
With experience some idea may be gained of a good arrangement, but this must still 
depend to a large extent on a further limiting factor: 

(iv) Habitat. Not all places are equally frequented by Apodemus, yet this cannot easily 
be attributed to obvious ecological differences. We have caught mice in large numbers in 
very varied habitats, whereas a comparatively uniform plant community may have good 
and bad patches. We are inclined to suspect that any kind of overhead cover, such as dense 
bracken, heather or thorny scrub, is of as great importance as food or soil factors, but much 
more work is needed to discover where the mice live as apart from where they wander, and 
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whether or no they live in large communities in the winter and separate before the breeding 
season. Light might be thrown on food preferences by analyses of stomach contents and 
by experimental feeding. 

The little information we have on all these matters, together with further notes on our 


methods of trapping and marking, we hope to publish in our papers on survival and 
wandering. 


3. MEASUREMENTS 


For our future comparison of populations from different localities we needed the skeletons 
of fully grown mice. In order to find out at what time of year a mouse population contains 
the largest proportion of such mice, we studied the growth changes of our living population, 
weighing and measuring each mouse, and recording the numbers of mice of each size in 
each month ; we were also able to keep individual growth records of marked mice. The data 
thus collected are analysed in the last three sections of this paper. 

We had often to handle a large number of live mice in a very little time, so we limited 
ourselves in their case to two comparatively easy measurements: (i) the weight, (ii) the 
length of the right hind foot. For any mouse found dead in a trap, or that died in captivity, 
we made further records of: (iii) the total length of head and trunk, (iv) the length of tail, 
(v) the weight of stomach, (vi) the weight of the reproductive system in the male, or of 
visible embryos in any pregnant female. 

We have made these same measurements on all mice trapped for skeletons in other 
localities, and have so gained a first rough index to regional differences. So far we have 
carried out this regional trapping near Westerham in Kent, Hampden in Bucks, Swanage 
in Dorset, and St Mawes in Cornwall. We trapped in each of the years 1937, 1938 and 1939, 
at the very end of March or beginning of April, a time when winter males are past their 
period of maximum weight increase (p. 155), and when young mice are in most years soon 
likely to appear in the traps. These dead mice together form the adult population from 
which the data of Table 1 (p. 139) were calculated. In Fig. 1 their measurements are added 
to those on dead mice caught at Holwood Park and Downe in Kent, and at Cobbler’s Hill 
in Buckinghamshire, at other times of the year. In a future paper we hope to give an 
analysis of the local variation shown by this material, together with such further informa- 
tion on growth as can be gained from correlating the external and skeletal measurements 
of the series of dead mice of all ages from Holwood. 

To facilitate statistical analysis the records of each mouse, dead or alive, were entered 
straightway on to separate index cards, the reference number on each card being that of 
one individual mouse. 

Live mice. It was easy to weigh the live mice. We let them drop from the trap into a large 
net bag, and from there transferred them by hand to a small, narrow bag in which they 
could be tied firmly and placed on the balance. The feet were not quite so easy to measure 
accurately, but we coaxed the mice by gentle pressure from the small bag into a cupped 
hand, one of us holding them there with the right-hind leg projecting between forefinger 
and thumb, while another (always the present writer) stretched the hind foot at right angles 
over the holder’s forefinger and measured it from heel to fleshy tip of central digit with a 
pair of screw callipers with Vernier scale. In this technique we attained a degree of accuracy 
represented by a standard error for a single measurement of 0-18 mm.* 


* This is the s.p. for repeat measurements on the same mouse by one observer, based on 10 or more obser- 
vations on each of 50 adult male mice, the average foot-length being 23-0 mm. 
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We sexed each mouse while holding it, and noted whether a female’s vulva was per- 
forate. We also placed it in a glass inspection jar—a barrel-shaped lamp glass with one 
opening sealed and the other wide enough to admit a hand—and noted the amount of 
yellow streak on the hairs of the chest, a very variable feature. 

Post-mortem errors. That favourite measurement of systematists, the combined length of 
head and trunk, is subject to very great observational error. Indeed, a wide range of error 
must be allowed for in all external measurements on dead mice, taken as they usually are 
at varying intervals after death.* Loss of body weight due to drying or decomposition 
takes place slowly, and we found no appreciable change in 24 hr., but the feet tend to dry 
and shrink before this. Length of foot, and length of head and trunk, will both be shorter 
if measured during the period of rigor mortis than either before or after; they vary with the 
stage of rigor and the amount of massage given before measuring, though in extreme rigor 
it is doubtful if the massage has much effect. As far as dead mice are concerned we hope 
to obtain more accurate information about growth from skeletal measurements, since these 
are of a more stable nature and less liable to observational error. 


Table 1. Above : The variation in five measurements on a number (N.) of dead adult or nearly adult 
Apodemus caught in four localities between 28 March and 13 April 1937, 1938 and 1939. 
Mean (M.), standard deviation (s.D.), and coefficient of variation (c.v.). Below: The 
correlation between these measurements ; correlation coefficient (C.C.). 






































Male Female 
Measurement 

N M S.D C.V. N. M S.D C.V. 
Weight (g.) 161 23-4 2-38 10-2 115 18-8 2-20 11-7 
Head and trunk length (mm.) 162 88-6 4-84 5-5 121 84-1 4-89 5:8 
Tail length (mm.) 142 90-3 5:27 58 107 86:3 4-96 5:7 
Right hind foot length (mm.) 162 22-9 0-82 3-6 121 22-3 0-69 3-1 
Weight of reproductive organs (g.) 149 2-0 0-41 20:5 














Male Female 
Correlation 
N C.C. N. C.0. 

Head and trunk length with weight 161 0-59 115 0-53 
Head and trunk length with tail length 142 0-51 107 0-46 
Head and trunk length with hind foot length 162 0-56 121 0-47 
Tail length with hind foot length 142 0-57 107 0-38 
Weight with weight of reproductive organs 149 0-66 























Weight as a measure of growth. The easiest measurement of a live mouse, its weight, is 
unfortunately not a very satisfactory measure of growth. Apart from seasonal changes, it 
alters considerably from one day, or even hour, to another. This must be partly due to the 
condition of the stomach and bladder, but we suspect that a rapid fall in actual body weight 


* Sumner (1927) has shown how great this error may be, by comparing measurements made on the American 
deer-mouse Peromyscus, (a) by different observers, (b) by the same observer at different times. 
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can also occur after a period of prolonged activity. Frequent trapping, and trapping 
during the breeding season, may also interfere with weight increase. 

Some idea of the extent to which the weight of a mouse is correlated with its length, 
and therefore indirectly with its growth, is given by the scatter diagram of Fig. 1. We 
were not able to measure the length of our live mice, but this diagram is based on the 
measurements of the dead male mice referred to on p. 138. It must be examined with 
reserve, however, as the number of young is few, and the mice were caught at varying times 
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Fig. 1. Correlation between weight and length of head and trunk in 367 dead male Apodemus 
caught in various localities at various times of year. 


of year in several widely separated localities, the young in different localities from most of 
the adults; and, as we hope to show in this and a later paper, weight varies with season 
and size with locality independently of the age of a mouse. In the mixed batch of adulf or 
nearly adult mice from Westerham, Hampden, Swanage and St Mawes (included in the 
upper part of the diagram), we have found the coefficient of correlation between these two 
measurements to be r= 0-59 for 161 males and r=0-53 for 115 females (Table 1). To cal- 
culate comparable coefficients for growing mice would call for far larger numbers in the 
lower weight groups, and for this reason, and because of the mixed nature of the material, 
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we have thought it inadvisable to fit a curve to the diagram.* Any such curve would tend 
to flatten as growth ceases, increase of length with increase in weight becoming solely due 
to the size variability of the fully grown population. For apart from all the sources of 
variation mentioned, weight and length must vary not only with age but also with in- 
dividual character. We have been unable to separate these two sources in growing mice, 
and without more frequent individual records (and these would probably interfere with 
growth) it does not seem possible to be certain that any one live mouse is fully grown. 
Records taken throughout the year suggest that, in the single locality of Holwood Park, 
20-0 g. in a male and 17-0 g. in a female can be taken as arbitrary but useful minima below 
which most of the mice may be expected to be still growing, except in midwinter when 
heavier adults often fall to lower levels than this (p. 154). It will be seen from Fig. 1 that 
males of over 20-0 g. range from under 80 to 100 mm. in length, while those of less than 
20-0 g. can range as high as 90 mm. 

A measure of the variability of weight in adult mice from scattered localities is given by 
the coefficients of variation (100 x standard deviation/mean) which we obtained from the 
mixed batch mentioned above. These coefficients are given in Table 1, where they are seen 
to be nearly double the coefficients for length of head and trunk, although the variability 
in this case includes much greater observational errors. Ruger (1933) found coefficients 
for weight as high as 15-93 and 14-77 in his human maierial (corrected to the standard 
ages of 40-5 years for men and 32-5 years for women), while his coefficients for stature, 
sitting height, and span are less than ours for head and trunk length, and for tail length. 

Summary of factors influencing weight. 

(1) Age. 
(2) Inherent} size of mouse. 
(3) Locality. 
(4) Season: (i) fat deposit; (ii) condition of reproductive organs and, in females, 
presence of embrygs. 
(5) Prolonged activity. 
(6) Trapping interference. 
(7) Content of stomach and intestines. 
(8) Content of bladder. 
(9) Disease (see footnote to p. 158). 

Tail length. The length of the tail from anus to tip, excluding the terminal pencil of 
hairs, is another conventional record of systematists. It is more easily taken on a dead 
animal than is the length of the head and trunk, and is less subject to observational errors, 
but as the tail is liable to shortening through injury, our number of records is not so great. 
The average length of tail in adult mice is very similar to the average length of head and 
trunk, and this is true for mice of all ages found in the traps, indicating that the tail 
continues to lengthen as long as the body is lengthening. On the other hand, each varies 


* A clear analysis of the difficulties of fitting curves to comparable but much more numerous human measure- 
ments has been given by Ruger & Stoessiger (1927) and Ruger (1933). From measurements of fifteen characters 
taken in 1884 by Francis Galton on over 7000 men, women and children, they computed means, standard 
deviations, correlation coefficients, correlation ratios and regression equations, and constructed graphs of the 
regression lines. These papers, and the analytical methods of the biometric school in general, are well worth the 
consideration of those biologists who attempt to base systematic classification, and even evolutionary theory 
on so-called allometric growth curves. 

t Inherent. Def. 2 in Shorter O.2.D.: ‘ Existing in something as a permanent attribute or quality’. 
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to a considerable extent independently of the other (for adult mice, r= 0-51 in males and 
r= 0-46 in females), so that neither can be taken as an index to the other in any particular 
mouse. Fig. 2 is a correlation diagram for the two characters derived from the same series 
of dead male mice that formed the basis of Fig. 1. The same qualifications, due to hetero- 
geneity of the data, apply as were referred to on p. 140. 
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Length of H+ T in mm. 
Fig. 2. Correlation between length of tail and length of head and trunk in 322 dead male Apodemus 
caught in various localities at. various times of year. 


An exceptionally short tail in any weight group may indicate an early loss of tip not 
perceptible or not noticed on measuring. If Apodemus is caught by the end of the tail, the 
skin easily slips off, if the grip on the bone is not tight, often allowing the mouse to escape; 
the protruding bone soon dries up and breaks off. If the loss is of any length—and more 
than half the tail may be lost in this way—it is easily perceptible, as the tail ends bluntly 
without the terminal pencil of hairs, and sometimes with a short length of exposed bone; 
in such cases no measurement was made. 
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Length of hind foot. From our measurements on live Apodemus we have found that the 
hind foot reaches maximum length long before the mouse stops increasing in weight, so 
that this length, within the error of measurement (see p. 138), early becomes a fixed 
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Weight in g. 
Fig. 3. Young live Apodemus, Holwood Park, 1937-8 and 1938-9. To show that length of hind foot 
rarely increases after a weight of 12-0 g. is reached in males, or 11-5 g. in females. 


characteristic of the mouse, and is especially useful in establishing local differences. Fig. 3 
shows that the hind foot of the Holwood Park mice ceased to grow after the mice had 
reached a weight of 12-0-13-0 g., about half the maximum weight of the males. Where 
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the mouse was caught only once the measurement (marked by a cross) throws no direct 
light on the final value of the foot length, but it combines with the rest of the data to give 
a scatter diagram indicating that foot length was still increasing in growing mice at the 
lower body weights. Where the mouse was caught more than once the measurement is 


' 


either marked by a letter A, indicating adult value, when no subsequent increase was 
found, or by a number, representing the number of millimetres short of adult value. It is } 


clear that in almost every case where repeated measurements were obtained four or more 
weeks later (see p. 149), the hind foot had ceased to grow in length by the time the mouse 
weighed 13-0 g., and very often before that. Thus this measurement gives no indication of 
the age of any mouse weighing more than 13-0 g. 


In an April population (Table 1) the correlation between length of hind foot and length 


of head and trunk (r=0-56 for males and r=0-47 for females) is very similar to that — 


between length of tail and length of head and trunk; that is to say, it is not sufficiently 
great to give a clear indication whether an exceptionally small mouse in such a supposedly 


adult population is likely to be inherently small or merely delayed in growth and therefore 
to be rejected for statistical purposes. 


Table 1 compared with human data. The coefficients of variation obtained by Ruger | 


(1933) have already been referred to (p. 141). He gives correlation coefficients for span and 
weight, stature and weight, sitting height and weight, sitting height and span, ranging from 


r= 0-554 to r=0-598 for men, and from r=0-434 to r=0-595 for women, those for women * 


being quoted from Elderton & Moul (1928); these coefficients are very similar to those 
for the Apodemus material of Table 1 and, as there, are less in every case for females than 


for males. Much greater correlation is found in man between stature and span (r=0-818 | 


for men, r=0-824 for women) and between many skeletal measurements. The material 
from which these coefficients were derived, however, was of too different a character from 
ours to warrant detailed comparison. 


A reference to Sumner’s (1926) data on American deer-mice is also called for here. Besides ’ 


other measurements and colorimetric estimates, he gave the means and standard deviations 
for weight, body length (= head and trunk length), and foot length, in three varieties of the 
Peromyscus polionotus group from Florida and Alabama. These are evidently much smaller 
mice than our Apodemus sylvaticus, and quite differently proportioned. For geographical 


reasons his data are again not directly comparable with those in Table 1, and comparison is _, 


better postponed until we deal with local variation. 

Stomach weight. The part which stomach content plays in the total weight of a mouse 
can be judged from Fig. 4, the data for which came from the series of dead mice trapped 
with break-back traps for their skeletons (p. 138). The stomachs were cut at the pylorus 
and at the lower end of the oesophagus, and the spleen removed. As no material difference 


could be detected between the stomach weights of the two sexes these have been bulked | 


together in the histogram. 
It will be seen that the range of weight in 268 mice was from 0-3 to 4-0 g., with a mean of 


1-2 g. and a standard deviation of 0-79 g., but that stomachs of 0-3-0-8 g. were the com- | 


monest. Weights above 3-0 g. were rare, and the one 4-0 g. stomach, that of a female, was 
so enormously distended that it appeared to fill the whole abdominal cavity. When the 


stomachs of two male and two feme:e mice were scraped clean of contents, one of them 


weighed 0-3 g. and the others approximately 0-25 g., so that the ten stomachs in the 0-3 


column in Fig. 4 may be regarded as empty. For younger mice the upper limit of stomach | 


=, 
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weight will of course be less, varying with the size of the mouse, while the lower limit may 
be as little as 0-1 g., as we have found in the unscraped stomach of a 7-0 g. youngster. 

The prevalence in the snap traps of mice with empty or nearly empty stomachs indicates 
that usually only hungry mice seized the bait. Mice living near the site probably found the 
traps early in their nightly wandering in search of food; they rarely had time to swallow 
the bait, though they may have eaten some of the oat grains scattered in front of it as an 
extra attraction. 

But the mice with whose growth this paper is mainly concerned were caught alive in 
large box traps,* to the end of which was fixed a } lb. cocoa tin with bedding, two peanuts, 
and a large pinch of oats. Together with the bait (another peanut), this was more food 
than they usually ate, so that when weighed these live mice are less likely to have had an 
empty stomach than those caught in snap traps. On the other hand, with excess of food 
available, the stomach content may have depended on the time of day at which they were 
taken from the traps to be weighed, and this varied from 10 a.m. to 7 p.m. according to 
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Fig. 4. Frequency distribution of stomach weight in 268 male and female Apodemus caught in four localities 

between 28 March and 13 April 1937, 1938 and 1939. 0:3 g. is the weight of an empty stomach. 
the size of the previous night’s catch. Elton et al. (1931, p. 716) recorded some observations 
on the times of feeding activity in Apodemus, and on the quantities eaten, but more data 
about this are needed. All we can say here is that a variation of as much as 4-0 g. in the 
weight of a full-grown mouse may be due to stomach content, while any variation of up to 
1-5 g. must so commonly be due to this factor that it should be discounted as an indication 
of change in body weight where only a single mouse is concerned. 

Reproductive organs. The growth of the reproductive organs in the male, and the presence 
of embryos in the uterus of the female, are among.the factors influencing body weight. We 
have dissected out and weighed together the testes, vesiculae seminales, Cowper’s glands 
and penis of all dead male Apodemus. We have also weighed together in their uterine casing 
any embryos large enough to make visible swellings in a dead female. Some of these mice 


* Selfridge traps (Elton ef al., 1931, p. 14). We are indebted to Messrs Selfridge for supplying us with these 
traps at a reduced rate. 
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were found dead in the box traps set in Holwood Park, and some were caught in snap 
traps elsewhere. 

(a) Male. The weight changes of the reproductive organs of a male mouse appear to be 
correlated with the changes in its body weight. We shall presently show that in midwinter 
only very small males gain appreciably in body weight, while larger mice are stationary, 
and autumn adults lose weight (pp. 158-161 and Chart 1,C1, D1, D2). Fig. 5 shows the 
weight of the reproductive organs of over 200 male mice of varying weights in different 
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Fig. 5. Weight of reproductive organs in male Apodemus of different weights at different times 
of year. The mice were caught in several years and several localities. 


months of the year. Out of 112 of these mice caught in the months September to January 
inclusive (the majority in November and December), only fifteen weighed more than 18-5 g., 
the mean weight being 14-5 g. as in the November and December histograms of live mice 
on p. 159. Only six of these fifteen mice of adult or nearly adult size had well-developed 
reproductive organs weighing more than 1-7 g., and only two had organs of intermediate 
weight ; while the remaining seven, and all but two of the mice of under 18-5 g., had organs 
weighing less than 0-8 g., by far the most frequent weights being from 0:1 to 0-3 g. In 
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several mice weighing from 17-0 to 19-0 g. the testes had evidently been larger, but were 
withdrawn into the abdomen and appeared like empty, shrunken sacs; the vesiculae 
seminales and Cowper’s glands were also shrunken. These mice may have already reached 
maturity, but had undergone loss of weight and of reproductive activity at the onset of 
winter, and were in a condition similar to that described by Brambell & Rowlands (1936) 
in bank voles.* 

After January, when our live records show that the majority of male mice gain in body 
weight (pp. 154, 155 and Chart 1, C3), it appears that their reproductive organs also grow 
rapidly in most years, coming into the intermediate group of 0-8—1-7 g. The eleven February 
mice shown in Fig. 5 were all in this stage except one, in which the organs already weighed 
2-0 g. Among other mice in this stage were: 

(i) Asingle December mouse of 20-5 g., whose testes were still unshrunken though small. 

(ii) Two advanced January mice. 

(iii) Several March mice, presumably slow in their development. 

(iv) A group of young summer mice, all under 17-0 g. 

(v) Three large mice caught in April, May, and June, of whose significance we are in 
doubt. 

All but one of the 55 mice with reproductive organs weighing over 1-7 g. were themselves 
over 18-5g., and therefore of adult or nearly adult size; the majority of the March mice 
had reached this stage, the weight of their reproductive organs ranging as high as thet of 
adult mice in later months.t Since the range of body weight of adult spring and summer 
mice is from just over 18-0 to just over 28-0 g., and the range of their reproductive organs 
is from 1-8 to 2-8 g., if there is any correlation between the two weights it appears that in 
spring and summer about 10 % of the body weight may be attributed to the reproductive 
organs. This is very different from what may be expected in midwinter, when these organs 
contribute only 0-1—0-3 9%. We have very little data for any one month in the summer, 
however, and cannot trace the month to month connexion between the two weights. There 
is some slight evidence from our records of live mice that a temporary loss in body weight 
occurs after breeding (p. 155), and it would be of interest to know if this is reflected in the 
reproductive organs. 

For April, a better picture of the range in body weight and in weight of reproductive 
organs and of the correlation between them (r=0-66), is given by the scatter diagram in 
Fig. 6. This represents 149 male mice of which the majority were probably adult; they are 
among those described on p. 138 as caught in snap traps between 28 March and 12 April 
in four different localities in 1937, 1938 and 1939.{ 

(6) Female. Of 121 females caught at the same time as the April males just referred to, 
thirteen had embryos visible in the uteri, indicating that the breeding season had started 
in all three years, though there were actually some local differences in this respect. These 


* We understand that these authors have material for a similar paper on Apodemus; much of our own work 
must be complementary to theirs, and we very much regret that they have not yet been able to publish it. 

+ Nineteen males were caught in February and March 1942, too late to be included in Fig. 5. In this year 
both January and February were exceptionally cold months; the mice caught in February corresponded in 
body weight, and in the development of their reproductive organs, with those caught in January in previous 
years, while those caught in March corresponded with February mice of previous years. 

t Since the four groups of mice from these four localities differ among themselves and from the Holwood 
mice in size (including that early maturing element hind foot length), and therefore in body weight, the rough 


estimate of 20g., given on p. 141 as the weight below which most Holwood males may be expected to be still 
growing, cannot be applied to these data. 
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Fig. 6. Correlation between weight of mouse and weight of its reproductive organs in 149 male Apodemus 


Table 2. Female Apodemus caught between 28 March and 13 April in 1937, 1938 and 1939, 


caught in four localities between 28 March and 13 April 1937, 1938 and 1939. 


in four different localities. Weight distribution of those with and without embryos. 
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females ranged in weight from 14-5 to 24-5 g., with an outlier at 28-3 g. They are shown 
in Table 2, which gives particulars of those with visible embryos; where the letter S is 
recorded in the last column but one the weight for embryos and uterus was negligibly 
small compared with the body weight. 

It is clear from these data that even at the beginning of the breeding season, heaviness 
in a female is not necessarily due to the weight of embryos. In our live trapping in 1939 
we found that there was a period of universal weight increase for females between March 
and April, when mice of all weights showed a wide range of increase (p. 158, and Chart 2, 
C5, C5X, D5, D5X), although the weather was very cold and no young appeared in the 
traps until May : there seemed no connexion between pregnancy and previous weight. Prof. 
Brambell’s material may throw more light on this point. In non-pregnant females the repro- 
ductive system contributes but an insignificant amount to the weight of the mouse. 


4. MONTHLY WEIGHT RECORDS 
(a) General discussion 
Our best records of weight changes in live mice are from the season 1938-9. We trapped 
each month in two neighbouring parts of the Holwood Park woods,* a week in one part 
followed by a week in the other, from the beginning of the third week in November until 
the summer breeding season. 

We have given the weeks of the year arbitrary numbers, so that one year may be the 
more easily compared with another. Week 1 always starts on the first Monday in October. 
This has the disadvantage that there may be as much as six days’ difference between a 
week in one year and the week with corresponding number in another year. 














West of footpath East of footpath 
Month 
No. of week Dates No. of week Dates 

November 7 14th to 17th 8 2lst to 25th 
December 11 12th to 15th 12 19th to 23rdt 
January 15 9th to 12th 16 16th to 19th 
February 19 6th to 9th 20 13th to 16th 
March 23 6th to 11th 24 13th to 18th 
April 29 17th to 22nd 30 24th to 29th 
May 33 15th to 18th — _— 
June — — 38 19th to 22nd 
July 43 26th to 27th “= — 























It will be seen from this list that trapping was repeated every four weeks until after the 
March catch, but that six weeks elapsed between the beginning of the March catch and the 
April catch, five weeks from May to June, and five weeks from June to July. In the first 
four months we usually trapped for four nights only in each area, the first four nights of 
each trapping week; in March and April for six nights in each trapping week, in May and 
June for four nights, in July for only two nights. In each of the last three months we 
trapped in only one part of the woods in order to interfere less with breeding. 

Chitty (1937, p. 43) has shown that mice caught one night and set free next day are 


* We are greatly indebted to the late Lord Stanley and to Lady Stanley for permission to trap on the Holwood 
estate at Keston in Kent, the site of a great Iron Age camp and once the home of William Pitt, who here enclosed 
many acres of common land, tended the woodlands, and planted trees on the bulwarks. 

+ All traps were taken in on the night of 21 December owing to a fall of snow. 
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likely to reappear in the same or a neighbouring trap night after night. This keeps other 
mice out of the traps, or, if surplus traps are set, greatly adds to each day’s labour. We 
therefore placed all the mice caught in cages, a separate cage for each trapping site, and 
released them in their home area only after the traps had been taken up at the end of the 
week’s trapping. An exception was made in the case of pregnant or lactating females, 
which were set free each day; even these generally came back to the trap next night, and 
we are afraid that some interference with breeding must have taken place. At times young 
were born in the traps, or before the mother could be set free, and she then generally killed 
and ate them, or else left them to die. Once when we set a mother free with her young 
where caught, she did not take them back to the nest, but they were found abandoned and 
dead on the same spot next morning. At other times a released female would show a great 
drop in weight when recaught next day, suggesting that her family had been born in the 
interval (p. 153 and Fig. 8, group VI, no. 600); as she may have been away from the nest 
for more than 12 hrs. it is all too likely that the nest young died meanwhile.* 

Even out of the breeding season, during a period of active growth, there seems some 
evidence that frequent catching interferes with weight increase. Mice caught in two con- 
secutive weeks, as happened frequently to those living on the adjoining borders of our 
two areas, often weighed less at the second catching. We have further evidence of this from 
other years when one area was trapped again after a short interval. It is conceivable that 
the commencement of breeding in a whole population might be delayed by too frequent 
trapping and too long periods of captivity. 

Chitty (1937, p. 46) has given a list of eight disturbing effects of trapping, and our method 
of keeping the mice together in cages for séveral days is likely to enhance and add to these. 
On the other hand, it probably enabled us to catch a greater number of mice and thus to 
have better data for statistical treatment; for we have evidence that the local inhabitants 
can be removed from an area in a night or two, and that outsiders then wander into it, if 
suitable weather conditions prevail. This evidence we hope to give in a future paper on 
wandering. 

We caught 343 different mice in the season 1938-9, of which 134 were never recaught, 
including nine which died in the traps or cages. Twenty-three mice were under 10-0. in 
November and December, but only nine of these were ever caught again, and only four 
reappeared each month throughout the season. These four mice gave us our most complete 
individual weight records, which are graphically represented in Figs. 7 and 8, groups I-VI, 
together with those of a number of other mice of greater weight when first caught. In 
these graphs as many mice have been selected as could be represented clearly in a few text- 
figures. Mice of less than 7-0 g. rarely appear in traps, and it is probable that they do not 
wander out from the nest until about this size unless the nest is destroyed or their mother 
fails to return to it; the smallest mice we have ever caught in traps were two males of 
5-5 and 58 g. 

Besides these fairly complete individual records we had many lasting over shorter 
periods. The spot diagrams of month to month weight changes (Charts 1-3) and the 
monthly weight frequency histograms of Figs. 10 and 11, are an attempt to utilize these 
data by massing them, which should to some extent smooth out hour to hour variability 
due to such causes as change in stomach content. 


* Burt (1940, p. 15) has, however, ‘one record of newly born Peromyscus which lived sixty hours without 
parental care of any kind’. 
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(b) Individual weight records. Growth graphs 


Males. Fig. 7, group I, shows the rapid rise in weight of four young males between the 
November and December catches. The rise was greatest in the smallest mouse, but not 
sufficient for it to overtake the others. Between December and January the rise was very 
slight ; in the smallest mouse, no. 468, it was still slightly greater than in nos. 423 and 528, 
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Fig. 7. Individual weight records of live Apodemus caught at monthly intervals in 
Holwood Park, November 1938 to July 1939. 


whereas no. 420 actually lost a little. At the top of group I, nos. 508 and 511, and possibly 
also no, 414, were already of adult size in November; they all showed loss of weight between 
November and December, and the first two continued to drop until January, while no. 511 
remained stationary. All the mice began to put on weight after the January catch, the older 
mice still remaining ahead of the younger mice when the breeding season was reached. 
Taking them all together the period of most rapid growth was between February and 
March; the increase tended to slacken after March or April, and was followed in three of 
them by actual loss in weight. 
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Fig. 7, group II, shows three more mice of adult size losing weight between November 
and January: nos. 489, 417 and 418. No. 487, of only 18-5 g. in November, also lost weight, 
and may also have reached maturity in the autumn breeding season, but if so it was an 
exceptionally small mouse. Nos. 506 and 475, of 14-5 and 13-5 g. in November, were 
evidently older than the young mice in group I and showed a much smaller increase than 
these by December; they did not change between December and January, and again 
showed smaller increases in the spring, both reaching only 20-5 g. by April, whereas the 
younger mice in group I reached from 22-5 to 24-5 g. The older mice in group II, like those 
in group I, kept far ahead of the younger ones, with the exception of no. 487 which was 
overtaken by them. This may be another indication that no. 487 was an exceptionally 
small mouse; it had a very short hind foot, only 22-0 mm., but, as has already been shown, 
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Fig. 8. Individual weight records of live Apodemus caught at monthly intervals in 
Holwood Park, December 1938 to July 1939. 


there is not a very high correlation in adults between length of hind foot and size of mouse. 
That there is also no close association between foot length and rate of spring increase in 
weight can be seen in Fig. 9, where, however, one of the two dots in the bottom left-hand 
corner represents no. 487. 

Fig. 8, group III, contains the weight graphs of seven male mice which were not caught 
until December. A comparison with Fig. 7, group I, suggests that nos. 564 and 549 were 
very young mice in November, as they showed a sharp rise in weight between December and 
January. The larger mice, nos. 553, 568 and 574, remained nearly stationary, and judging 
from other mice not included in the diagrams were likely to have had about the same 
November weight of 16-5-17-0 g., though there were also a few smaller mice in November 
which increased to that weight by December ar’ January (Chart 1, D1, D2). The largest 
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mouse of all, no. 576, lost weight between December and January. They all showed the 
rapid increase which was common to male mice in the spring, and which slackened off 
sometimes at the end of March, sometimes in April. 











Females. Fig. 7, groups IV and V and Fig. 8, group VI, [Tage 
give the weight graphs of a number of female mice. They | of R-HF. 
show that young females soon fell behind young males of bates & 
comparable November weight; no. 433 was an exception, : ee e 
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521, 425 and 459, still showed losses between January ; &% nt ‘ 
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which may have been due to early pregnancy; they were Fig. 9," Live male Apodemus, Holwood 

comparable in amount with those of the males between pa;k, 1939. To show that there is 

these months, and between January and February, but almost no correlation between length 

were not as great as those of the males between February °f hind foot and rate of spring in- 
crease in weight. 

and March. 


Advanced stages of pregnancy may be picked out by a sudden sharp rise in the graph, 
while a long vertical fall following this in the same week may be presumed to indicate the 
birth of young, though we had no other evidence of this when it occurred during release 
from captivity (see p. 150). Two examples may be taken from the graphs: no. 431, group IV 
and no. 600, group VI. No. 431 weighed 24-2 g. on i6 May. She was released near the same 
trap that afternoon, and found again next morning in another trap 10 yards off, together 
with three young which she had killed; her weight on this second day was only 18-6 g. 
She may have given birth to more young and eaten them, but in case there were still 
young unborn she was released again in the afternoon. Next day, 18 May, she was recaught 
in the original trap and then weighed only 17-4 g. No. 600 was caught on four consecutive 
days in June; she had the unusually heavy weight of 31-0 g. on the first day, had lost only 
a gram on the second day, but on the third day had dropped to 24-7 g., a weight which she 


retained on the fourth day. Apart from this drop in weight there was in this case nothing 
to indicate the birth of young. 


Summary of information from individual weight graphs 

Males. (i) Between November and December the smallest males gained most weight, 
larger mice remained stationary, while the largest lost weight. 

(ii) Between December and January some of the smallest mice still gained a little 
weight, larger mice remained stationary or continued to lose weight. 

(iii) Nearly all males put on weight after the January catch. 

(iv) The period of most rapid male increase was between February and March. 


(v) After March or April gains were smaller and some losses occurred. 
Biometrika 33 
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(vi) Most of the older mice were still ahead of most of the younger mice at the beginning 
of the breeding season. 

Females. (vii) Between November and December the weight changes of the females 
were similar to those of the males, except that the smallest females did not gain so much 
weight as the smallest males. 

(viii) From December to March no general rule was apparent. 

(ix) Between March and April increases were universal. 

(x) In this year the breeding season started at the end of April. 


(c) Combined weight records. Spot diagrams of month to month weight changes 

The information just summarized comes from the very small sample of the population 
represented in the individual weight graphs. It is confirmed and amplified by the spot 
diagrams of month to month weight changes, which represent rather larger samples of the 
1938-9 population. There are three series of these diagrams shown on Charts 1-3, and it is 
to these that the letters and numbers in the present section refer; each puts on record 
information not given by the others, and between them they enable most of the data from 
all the mice caught more than once to be utilized. 

A series. Weight in each month plotted against loss or gain by the following month.* 

C series. Weight in November plotted against loss or gain in each succeeding monthly 
interval. 

D series. Weight in November plotted against weight in each succeeding month. 

The CX and DX diagrams are given because we had a very large catch in January and 
the records of a number of new mice start then. The records of those males caught in 
November seem to show that autumn adults could still in January be distinguished by 
weight from autumn young. Judging from the data given in Chart 1, D2, the former might 
be expected not to have fallen to a weight of less than about 18-5 g. and the latter not to have 
increased to this weight. For adult females it is convenient to take a lower January limit 
of 17-5 g., for though some had undoubtedly dropped below this, it was then impossible 
to distinguish them by weight from growing young. 

Adult males’ winter loss of weight and early spring increase. The bulk of the November 
males of 18-5 g. and over lost weight by December, some of them as much as 2:5 g. 
(Chart 1, D1, C1).¢ Between December and January some again lost weight and some 
remained stationary (Chart 1, C2), the bulk of them in January being from 1-5 to 2-5 g. 
behind their November weight (Chart 1, D2). Between January and February some began 
to put on weight again, but some were stationary (Chart 1, C3, C3X) so that while a few 
were now ahead of their November weight, others were still from 1-5 to 2-5 g. behind it 
(Chart 1, D3). Between February and March nearly all showed big gains, most commonly 
of about 2-0 to 3-0 g. (Chart 2, C4, C4X); all now reached their November weight again, 
and some were as much as 3-5 g. ahead of it (Chart 2, D4). Between March and April the 
gains were very irregular and there were two losses, the range of increase being from 

—0-5 to +4-5g. (Chart 2, C5). The bulk of the adult mice were now from 2:5 to 3-5 g. ahead 
of their November weight; the heavier the mouse in November the heavier it usually was 
in April, the range of weight being then from 23-5 to 30-0 g. (Chart 2, D5). 

* Al=C1; diagrams A2, 3 and 4 have been omitted for economy in space. 


+ The weights given in the text are the actual weights of the mice correct to 0-1 g. In the spot diagrams the 
data are grouped into 1-0 or 0-5 g. divisions, and so may sometimes appear not to correspond with the text. 
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Young males’ winter and early spring changes. Of the November male young (under 
17-5 g.), those under 15-5 g. had all gained weight by December, and, on the whole, the 
smaller they were the more they had gained; those under 11-5 g. had advanced very fast 
indeed, some gaining 5-0 to 6-0 g. and overtaking many of those above this weight which 
showed only small increases (Chart 1, D1). Between December and January the two mice 
which were smallest in November had both gained another gram, whereas the slightly larger 
mice had gained less than this or lost a little (Chart 1, C2); for the whole period, November 
to January, it remained true that the smaller the mouse in November the more it was likely 
to have gained (Chart 1, D2). Between January and February the bulk of the November 
young put on weight, some as much as 4-0 g. (Chart 1, C3), but the most rapid period of 
spring growth in this year, for the young as for the adults, was between February and 
March, when there were no losses in weight among the younger males but a wide range of 
increase, gains of 4-0 to 4-5 g. now being very common (Chart C4, C4X). Six weeks later, 
towards the end of April and just before the breeding season, some further big increases 
were shown, but on the whole the rate seems to have slowed down, the commonest gain 
being about 2-0 g. (Chart 2, C5, C5X); it still held good that the smaller a young mouse 
was in November the greater was its expected gain, the smaller young having by now over- 
taken the larger young within the range of 20-5-24-5 g. (Chart 2, D5). 

An overlap in weight by February. Between the November young and the November 
adults an overlap in weight had already started in February, just before the period of 
maximum spring growth; this overlap was then between 18-5 and 20-5 g. (Chart 1, D3, 
D3X), while in March it was between 20-5 and 24-5 g. (Chart 2, D4, D4X), and in April 
between 21-5 and 26-5 g. (Chart 2, D5, D5X). From February on, therefore, there was no 
clear division by weight of the November male adults from the November male young, 
though the bulk of the former were still heavier than the bulk of the latter. 

Male weight losses in the breeding season. After the start of the breeding season our trap- 
ping was confined to one only of our areas in May, to the other in June, and to the first 
again, but for two nights only, in July. This meant that we caught many less mice in each 
month, and among these there were very few of our November catch: in May only three 
of the November male adults and four of the young, in June two of the adults and nine of 
the young, in July one of the adults and one of the young. 

In May and June the bulk of the November young weighed from 20-5 to 24-5 g., their 
average weight in May being 22-7 g., a gram higher than in June. The few remaining autumn 
adults were very scattered in weight, overlapping the young in range but with an average 
still a little ahead of them (Chart 3, D6X, D7X). Between April and May there were 
nearly half as many losses as gains, and although these had apparently no relation to the 
weight of the mouse in November (Chart 3, C6X), the losses were confined to mice which 
in April were 21-5 g. and over, while the laggards then under 21-0 g. all showed gains 
(Chart 3, A6); in all, nine mice lost weight, nineteen gained weight, and four remained 
the same. Between April and June losses outnumbered gains by fifteen to twelve (Chart 3, 
A7). It is perhaps significant that all these losses occurred in the early part of the breeding 
season, during the two months in which pregnant females first appeared in the traps. The 
net result, a male population almost stationary in weight, is best seen in the weight histo- 
grams (Fig. 10). 

A possible summer period of further weight increase. Of the thirty-four males caught in 
May only thirteen were recaught during the two nights when we trapped over the same 
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area in July. This is a very small number from which to draw conclusions, but since eleven 
of the thirteen had gained from 1-0 to 2-5 g., and only two had lost weight, it is possible that 
a new period of weight increase had followed the period of losses (Chart 3, A8). The increase 
in weight of this very small sample of the July population is again best seen in Fig. 10; 
leaving out the new season’s young, the average weight was 24-3 g., whereas the May and 
June averages were only 22-9 and 22-8 g.* It may be noted here that no young were born 
in the traps or cages in July, and that none of the females caught appeared to be in an 
advanced state of pregnancy, the heaviest weighing only 22-3 g. 


Table 3. Adult females’ loss of weight after capture, 1938-9 
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1 422. Ill, kept in for two days; weight increased again to 23-0 g. 
* 415 and 416. Both appeared nearly parturient and were not released. Young subsequently heard in cage, 
but died and were eaten. Mothers weighed again on 24 November and released. 


’ 466. Vaginal discharge on 22 November; not weighed, kept in cage. 24 and 26 November, young heard. 
1 December, six dead young seen, mother weighed 18-6 g. 

* 620. Appeared nearly parturient, kept in. Young seen and heard next day; some killed and partly eaten 
by mother, the rest placed by us in some hay with the mother in the woods where she had been caught. Young, 
all dead, found still in hay on third day; mother recaught weighing 17:6 g. 

5 431. Three dead young in trap. 

6 717. Bleeding at vulva. 


? 600. This very heavy mouse had a very long right hind foot. 
The remaining weight losses all occurred while the mice were set free overnight. 


Adult females in winter. Among the females the November adults were not so clearly 
marked off by weight from the November young as among the males. Teats were prominent 
in November in all female mice of 17-5 g. and over, and also in two out of three of about 
16-5 g. A heavy female rarely dropped in weight to less than 17-5 g. when known or supposed 
to have given birth to a family (p. 150, and Table 3); this is therefore a convenient weight 
above which to regard all females as adult, and ‘November adults’ of 1938-9 will here mean 
those shown on the graphs as 17-5 g. and over in November, though it is probable that 
females of a lower weight than this do become pregnant. 


* The averages given on p. 155 were for the November young only. 
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Late autumn pregnancies. Among these adults the biggest losses between November 
and December were in two mice which fell from 24-8 to 21-5 and to 17-5 g. respectively 
(Chart 1, D1). The weight graph of one of these has been seen in Fig. 7, group IV (no. 521). 
Most of the loss had already occurred when the mice were weighed for a second time in 
November (see Table 3)—the first mouse (no. 521) when recaught after a night’s release 
from captivity, the second (no. 466) at the end of a week’s captivity after giving birth to 
a family of at least six young. Such losses are comparable with those in May when it was 
known or suspected that pregnant females had given birth to young (p. 158). Three females 
of about 20-0 g. in November lost from 1-5 0 2-5 g. by December, while one of 17-5 g. 
remained at this weight (Chart 1, C1, D1); these mice may have been nursing mothers in 
November (p. 161). 

Among females, no conspicuous weight increase until after March. Between December and 
January one mouse in this group gained weight (but not as much as it had previously lost), 
three lost weight, and two remained the same (Chart 1, C2); between January and February 
one gained a little, two lost weight and two remained the same (Chart 1, C3, C3X); between 
February and March, the most rapid period of male growth, three lost weight and two 
remained the same (Chart 2, C4, C4X). The net result was that by March the five 
surviving November adults were all from 2-0 to 5-0 g. behind their November weight, 
in great contrast to the adult males which had all by now overtaken that weight, some 
showing gains of up to 3-5 g. (Chart 2, D4). In the six weeks between the March and April 
trappings a period of growth at last set in for all females, and the five surviving adults 
showed increases of up to 4-0 g. (Chart 2, C5, C5X), all but one of them having now nearly 
caught up with their November weight (Chart 2, D5). After April only one of these adults 
was ever caught again—in June, when its teats were prominent and its weight was 
26-2 g. 

Young females compared with young males in winter and early spring. Among the Novem- 
ber young only three very small females of 8-0, 8-1 and 8-7 g. in November put on much 
weight between November and December, and of these only the largest put on as much as 
the males of similar size (Chart 1, C1, D1). The individual graphs of these three were given 
in Fig. 7, groups IV and V (nos. 431, 450 and 433). Young females of 10-5—13-0 g. in November 
did increase slightly, but not nearly as much as the males of this weight, while from 13-5 
to 17-5 g. there were small changes in both directions. Between December and January 
the very young mice ceased their rapid increase, two of them actually losing weight, while 
those of intermediate size showed some gains and some losses (Chart 1, C2); young females 
behaved very like young males in this month, the juvenile population as a whole remaining 
stationary (Chart 1, C2). Between January and February the very youngest females were 
once more putting on weight, but from 10-5 g. upwards losses and gains about balanced 
one another; this was in contrast to the males, only two of which showed loss in weight, 
while there was a high proportion of big increases (Chart 1,C3,C3X). Between February 
and March the contrast was still greater; there were fewer losses among the young females 
but no very big increases, and four remained the same (Chart 2, C4, C4X). The net result 
was that the mice which were smaller in November had gained from 5-0 to 9-0 g. by 
March while the rest showed intermediate gains and losses in rough accordance with 
their November weight (Chart 2, D4). Already since February some of the November 
young had overtaken some of the November adults within the range of 16-5-18-5 g. 
(Chart 1, D3X and Chart 2, D4X). 
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The six weeks between the commencement of the March and the April trappings were, as 
stated, a period of growth for all females. Young mice as well as old showed gains of from 
0-5 to 4-0 g., about 2-0g. being the commonest (Chart 3, A5 and Chart 2, C5, C5X). These 
gains were much the same as those of the males for this period, since the male increase had 
by now slowed down, but nevertheless the bulk of the females were still far behind males of 
similar age (Chart 2, D5, D5X) and, in spite of further male losses in May and June, never 
overtook them unless in advanced pregnancy (Chart 3, D6X, D7X). 

Spring pregnancies. As already seen in November, advanced pregnancy reveals itself in 
these diagrams by a wide gap between an individual and the bulk of the female population,* 
just as it is disclosed in the individual weight graph by a sudden sharp rise (p. 153, and 
see also Table 3, p. 156). There were already two such individuals in April (Chart 2, C5, 
C5X), mice which had weighed only 15-2 and 16-2 g. in November (Fig. 7, group V, nos. 
497 and 457). In May four other females were widely marked off from the rest by excep- 
tionally big increases of from 8-5 to 11-0g. (Chart 3, A6); the individual graphs of two of 
these are seen in Fig. 7, group IV (nos. 431 and 419). All four had prominent teats, and two 
gave birth to young before it was possible to release them; a third (no. 717), after being 
released twice, was found to be bleeding at the vulva when caught for a third time, and as 
she had dropped in weight from 26-3 to 24-5 to 19-5 g., may be presumed to have given 
birth to a family during her second night of freedom. In June there were two other excep- 
tional increases, of 11-5 and 8-5 g. (Chart 3, A7); after release and recapture the weight of 
these mice was found to have dropped from 31-1 to 24-7 g. (no. 600), and from 26-5 to 20-0 g. 
(no. 660), so that they also may be supposed to have given birth to young in the interval; 
the individual graph of the first of these two is shown in Fig. 8, group VI (no. 600, see also 
p. 153).¢ The next largest increase between April and June was of 5-5 g., but in this mouse 
the drop in weight on recapture was gradual and not easy to interpret; it was from 23-0 
to 21-3 g. after the first night, and from this to about 20-0 g. after the second and third 
nights (Table 3, p. 156, no. 643). 

In June and July prominent teats were noted in all but two of the thirty-one females 
weighing 17-5 g. and over, whereas in May, out of twenty-five such mice, they were noted 
in only four besides those four already mentioned as in advanced pregnancy. 

Apart from these cases of pregnancy, the bulk of the young females showed advances 
of from 1-5 to 3-5 g. between both April and May, and April and June; two remained the 
same in May and two had lost weight (Chart 3, C6X, C7X). The only female which lost 
weight between the April and June catching was one of the two which had put on so much 
weight between March and April as to have suggested an early pregnancy (see above, and 
Fig. 7, group V, no. 497). 

None of the females caught as adults in November, or presumed to have been adults 


* It must not be forgotten that any single exceptional measurement might be due to an exceptionally full 
stomach (p. 144 and Fig. 4) or to an error in recording. Single errors may also creep in in some such way as 
follows. In March 1941 a female mouse of 29-2 g. was recorded on capture as appearing to be in an advanced 
state of pregnancy. She was not released and next day was found dead in the cage, and on dissection proved to 
have a mass of tapeworm cysts attached to the small remaining fragments of the liver, and occupying most of 
the greatly distended abdomen. This mass weighed 13-1 g., which, subtracted from the apparent weight of the 
female, would leave her true weight, without liver, at 16-1 g., that of a not yet fully grown mouse; when first 
caught in December she had the adult weight of 19-5 g. Had she not died she would have remained in the records 
as a unique case of pregnancy in the middle of March. 

t This mouse, no. 600, was apparently an exceptionally large individual. The length of its right hind foot was 

bout 24-0 mm., much above that of the average female. 
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then by their weight in January, were recaught in May or July, and only two* in June 
(Chart 3, D7X). 

(d) Combined weight records. Monthly histograms 
In Figs. 10 and 11 are frequency histograms} showing the monthly weight distribution 
of the Holwood Park 1938-9 male and female populations. Use is here made of all the mice 
caught, even those caught only once. In Fig. 12 these histograms are used as a background 
to a similar series for 1937-8. In that year, instead of an intensive fortnight’s trapping at 
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Fig. 10. Monthly weight frequencies of live male Fig. 11. Monthly weight frequencies of live female 
Apodemus, Holwood Park, 1938-9. Apodemus, Holwood Park, 1938-9. 


4-weekly intervals, each month’s catch was spread over the whole month; we trapped each 
week, covered a wider area, and shifted our trapping sites according to a different plan. 
Individual mice were caught at much less regular intervals and so gave less satisfactory 
records, while the monthly grouping is not strictly comparable with that for 1938-9. The 
general picture of weight increase in the two years is the same, however, and presents in 
another way information already given by the individual weight graphs and the month 


* One, no. 637, was not caught in April, so is not included in Chart 3, C7X. 


+ In these diagrams and in Fig. 12 the unit of frequency on the vertical scale equals the distance between 
the dots on the vertical lines. 
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to month spot diagrams: a late autumn population reducing its wide weight range by the 
growth of the small mice and the weight losses of the large mice; a nearly stationary mid- 
winter population ; and a rapid early spring increase among the males followed by a later, 
lesser increase among the females. Reduced trapping during the breeding season of 1939 
extended the picture for that year, and showed a slowing down of weight increase and the 
reappearance of a wide weight range. Comparison of Figs. 10 and 11 shows once more 
that in 1938-9 the female population kept up with the male until after January; between 
February and March was the period of most rapid male increase, whereas the female 
population did not as a whole gain much weight until after the March catch. 
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Fig. 12. Monthly weight frequencies of live Apodemus, Holwood Park. 
Plain outline, 1938-9. Hatched, 1937-8. 


Where differences between the two years can be detected they can probably be attributed 
to the marked weather differences. The more obvious differences in the histograms are as 
follows: 

(i) More young of both sexes in November 1938-9. 

(ii) More males of over 20-0 g. and females of over 22-0 g. in November 1938-9. 
(iii) A group of smaller mice of both sexes in February 1938-9, not present in 1937-8. 
(iv) A group of females of over 20-0 g. in March 1937-8, not present in 1938-9. 
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With regard to these four points, the following comments may be made: 

(i) The autumn of 1938 was exceptionally mild and breeding continued late. A number 
of 7-0—10-0 g. youngsters were caught in November and some even in December (cf. p. 150). 
All the females of over 17-0 g. and two of 16-5 g. had prominent teats; some were probably 
nursing young in the nest, while others may have been pregnant (cf. pp. 156-7). All these 
females were set free each day but they were often caught three nights running and we are 
afraid that some of the nest young may have died meanwhile (cf. p. 150 and footnote); if so, 
this would have reduced the number of 7-0 to 10-0 g. youngsters in the December population. 

(ii) The population of November 1938 differed from that ot November 1937, not only 
because of the number of very young mice but also because of a greater number of heavy 
adults. The male group of over 20-0 g. is especially conspicuous in the histogram of Fig. 10, 
and in weight distribution resembles on a small scale the total male population in April, 
the beginning of the 1939 breeding season. Most of this group, like the similar group of 
females, may still have been breeding, but more information is needed about the connexion 
between weight, season and fecundity in Apodemus, which we believe has already been 
collected by Brambell and Rowlands (pp. 146-7). In the November 1937 population there 
were far fewer males of over 18-0 g. in proportion to the intermediate group of 13-0-18-0 g. 
The earlier onset of cold weather is likely to have checked breeding, and the consequent 
recession of the male reproductive organs to have been accompanied by a loss in weight. 
The November catch in this year indeed appears comparable with that of December 1938, 
when adult males had lost from anything up to 2-6 g. of their November weight (p. 154 and 
Chart 1, C1, D1). 

As regards adult females, it is true that in November 1937 we caught at least ten which 
still had prominent teats, but though eight of these weighed over 18-0 g. and are part of a 
conspicuous group at the tail of the histogram, there were none comparable with the 1938 
group of over 22-0 g. Like the rarity of youngsters of under 10-0 g., and the evidence from 
the adult males, this again points to the 1937 breeding season having ended before 
November. One unique little mouse must not be forgotten however: a male of only 7-2 g. 
caught on 2 December of that year. 

(iii) The next conspicuous difference in the histograms is in February. While we can be 
fairly certain that the differences in November are connected with the long mild autumn 
of 1938, the reason for this February difference is less clear. Since in December 1938 we 
caught eight females of over 16-0 g., and among these were five new mice still with pro- 
minent teats, it is tempting to think that these or similar late breeders may have been 
responsible for the batch of small mice characteristic of February 1939 and not found in 
February 1938. Individual analysis, however, shows that this conspicuous group is as 
likely to be composed of mice whose spring growth was for some reason delayed, as to be 
the result of autumn breeding prolonged into December (see also Chart 2, C4, C4X). 

(iv) The only other conspicuous difference between the two series of histograms is in 
March, where among the females of 1938 there is a prominent group of over 20-0 g., absent 
in 1939. In March 1938 there were three almost unbroken weeks of warm, sunny weather, 
and towards the end of the month two families were born: one in a cage on 25 March after 
the mother had been captive for four days, and one in a trap on 26 March. On this we 
stopped trapping, hoping that we had not already interfered appreciably with the summer 
population by repeatedly catching and keeping in captivity members of the winter popula- 
tion which might otherwise have started to breed. In 1939, March was a comparatively cold 
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month, and early April very cold, so that it was not until the beginning of May that we 
began to get births in the traps and cages (cf. p. 158). 


5. SUMMARY 

1. Introduction (p. 136). 

2. A discussion of the factors which limit the size of the samples caught when trapping 
Apodemus populations for comparison by statistical methods (p. 137). 

3. A discussion of the value and reliability of certain conventional measurements, 
(a) in determining the probability that an individual mouse is fully grown, (b) in comparing 
fully grown populations from different localities (pp. 139-44). 

The part played by the stomach content (pp. 144-5) and by the reproductive organs 
(pp. 145-9) in the varying weight of a mouse (p. 141). 

4. (a) An analysis of monthly weight records of a living population in a single locality 
(pp. 149-50). These records were made in order to determine the best time of year to trap 
when comparing populations from different localities; but they also give a picture of the 
growth of Apodemus and how it is influenced by season. The picture is constructed from: 

(b) Graphs of the weight records of those individual mice caught most frequently in the 
trapping season 1938-9 (pp. 151-4). 

(c) Spot diagrams in which the weights, or weight changes, of individual mice are 
plotted against their weights in a previous month (pp. 154-8 and Charts 1-3). 

(d) Histograms comparing the weight frequencies of the sample populations caught in 
each month of the trapping season 1938-9; and a comparison of these histograms with a 
similar series for 1937-8 when weather conditions were markedly different (pp. 159-61). 


Our grateful thanks are due to Miss Enid Harris for her dextrous help in field and 
laboratory, to Miss Joyce Townend for her skill and care in making the diagrams, to Prof. 
E. 8S. Pearson for a critical reading of the manuscript which led to many improvements, and 
to Dr G. M. Morant and Mrs Karl Pearson for useful suggestions when reading the proofs. 
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THE CONTROL OF INDUSTRIAL PROCESSES SUBJECT 
TO TRENDS IN QUALITY 


By L. H, C. TIPPETT, British Cotton Industry Research Association 


In many industrial processes there is a progressive change in the quality of the products 
with time, which is tolerated up to a certain stage, after which corrective action is taken. 
An example occurs in the mass production of metal parts by a machining operation to some 
specified dimension. Random fluctuations in the dimension of articles produced at any one 
time are caused by such factors as variations in the materials processed and the manipulation 
of the machine, and superimposed on these is a trend due to tool wear, say in the direction 
of an increase in the dimension. At some stage, it is decided that too many articles are 
exceeding the tolerance limit of the dimension and something is done: we shall say that the 
tool is discarded. 

In order to decide when to discard the tool, one procedure is to take samples at regular 
intervals of time, measure the articles, and calculate the mean dimension for the sample; 
when that mean reaches a certain value the tool is discarded. But the sample means are 
subject to sampling errors so that they do not measure exactly the ‘true’ or population 
mean of all the articles produced at the given time. Consequently, if tools are discarded at 
one level of sample means, some will be discarded before the population mean has reached 
that level and others afterwards; and there will be a frequency distribution of states of 
wear, as measured by the population mean, at which different tools are discarded. It is the 
purpose of this paper to determine that frequency distribution and discuss its practical 
consequences. 

The argument will be developed in terms of the above example, but the results may be 
applied generally to any process having a trend in some characteristic that is estimated at 
regular intervals by any measure having a sampling error. 

The situation is shown diagrammatically in Fig. 1. The line A BC represents the change 
with time in the population mean, X, of the dimension. X is a characteristic of the tool as 
set up in the machine, and we may imagine it to be determined, in principle, by making and 
measuring over a very short interval of time surrounding each instant a very large number 
of articles, and calculating their mean. In the region AB the trend may be of any form, but 
in the region BC, which covers the range of the population dimension at which substantially 
all the tools are discarded, the trend is assumed to be a straight line represented by the 
equation 

X =d+ah, (1) 


where A is time measured in units of the intervals between the taking of successive samples, 
and d and a are constants. The constant d is chosen to be the level of sample means at which 
tools are discarded, so that h is measured from the time at which X = d. The instants at 
which samples are taken are 


(@—s), (@—s+1), (@—s+2), ..., (@-1), 0, (041), 


where @ is the interval between zero time and the first subsequent sampling instant, and 
8 is an integer defined below. 
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The sampling distribution of the mean is represented in Fig. 1 for the first three sampling 
instants, and P(—s), P(@—s+1), P(0—s+2) 
are the respective probabilities that the sample means exceed d. For any sampling instant 
(@—s+v) say, P(@—s+v) is the probability that a tool, surviving to that instant, will then 


be discarded. Let S(@—s+v) be the probability that any tool will survive to the instant 
(9@—s+v) and D(@—s+v) the probability that it will be discarded at that instant, so that 

D(@—s+v) = P(O—s+v) S(O—s+v). (2) 

We shall assume that for a population of tools, @ will have all values between 0 and 1 with 

equal probability and shall consider the elemental proportion d@ of those tools for which @ 

is within the limits 6 + $d. We shall also choose s such that P(@—s) = 0 to a sufficient degree 

of accuracy (i.e. that P(@—s)<e, where ¢ is a chosen small quantity), and P(@—s+1) is 


~—9 P(e-S+1) P(e-S+2) 


d ms : / x2d+ ah ra 
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time, fh. 
Fig. 1. 
the smallest probability of which account need be taken. Then, with the aid of equation (2), 


we obtain for any tool in the elemental proportion the following probabilities of survival to 
and discard at the successive sampling instants: 


Instant Probability of survival to instant Probability of discard at instant 
(O—s) S(0—s) = |] D(é—-s) = P(@-—s) S(@-s) z= 0 
(@-s+1) S(@—s+1)=S(6-s) —D(0-s) D(@—s+1) = P(@-—s+1)S(@-—s+1) 


(@-8s+2) S(@—s+2) = S(@—s+1)—D(@-—s+1) D(@—s+2) = P(O—s+2)S(O—s8+2) 


Thus, from the known probabilities P(6— 3s), etc., can be deduced the probabilities D(@—s), 
etc., and these multiplied by d@ are the elemental probabilities of the times of discard of 
tools, i.e. they are proportional to the ordinates of the probability distribution of the times 
of discard. , 

Probability distributions have been calculated for normal sampling distributions having 
a constant standard error (s.E.) given by the equation 


8.E. = ka, (3) 


where k = 1, 2,5, 10, 20, 33-3 and 50 respectively. The probabilities of the time of discard 
exceeding various values of h are given in Table 1; and a few useful constants, calculated 

















Table 1. Probability of time of discard of tool being later than h 


























6-0 | 0-0000 



































k=20 k=33-3 k=50 
| | 
h | Prob.| h | Prob h | Prob.| h | Prob.| A | Prob. h ieee h | Prob. 
| | j 
aa Set cnn Ga | F | | 
—78 | 1-0000 — 196-5 | 1-0000 
—38 | 1-0000 | —76 | 0-9999 | — 129-5 | 1-0000 | — 192-5 | 0-9998 
—3-4| 1-0000 | —6-8 | 1-0000 | — 18-5 | 1-0000 | — 37 | 0-9999 | —74 | 0-9997 | — 126-5 | 0-9997 | — 188-5 | 0-9994 
—3-2 | 0-9999 64 | 0-0908 — 18-0 | 0-9999 | —36 | 0-9997 —72| 0-9994 — 123-5 | 0-9994 | — 184-5 | 0-9990 
} | | 
—3-0 | 0-9997 | —6-0 | 0-9994 | — 17-5 | 0-9998 | —35 | 0-9995 | —70 | 0-9990 | — 120-5 | 0-9991 | — 180-5 | 0-9986 
—2-8 | 0-9993 | — 5-6 | 0-9987 | — 17-0 | 0-9997 | — 34 | 0-9992 | — 68 | 0-9985 | — 117-5 | 0-9985 | — 176-5 | 0-9978 
—2-6 | 0-9986 | —5-2 | 0-9973 | — 16-5 | 0-9995 | —33 | 0-9988 | — 66 | 0-9977 | — 114-5 | 0-9977 | — 172-5 | 0-9968 
—2-4 | 0-9974 | —4-8 | 0-9948 | — 16-0 | 0-9992 | —32 | 0-9982 | — 64 | 0-9965 | — 111-5 | 0-9967 | — 168-5 | 0-9955 
—2-2 | 0-9953 | —4-4 | 0-9905 | — 15-5 | 0-9988 | —31 | 6-9974 | — 62 | 0-9948 | — 108-5 | 0-9953 | — 164-5 | 0-9938 
| | | 
—2-0 | 0-9917 | — 4-0 | 0-9833 | — 15-0 | 0-9982 | —30 | 0-9963 | — 60 | 0-9926 — 105-5 | 0-9933 — 160-5 | 0-9915 
— 1-8 | 0-9859 | — 3-6 | 0-9717 | — 14-5 | 0-9974 | —29 | 0-9947 | —58 | 0-9893 | — 102-5 | 0-9906 | — 156-5 | 0-9885 
—1-6 | 0-9770 | —3-2 | 0-9539 | — 14-0 | 0-9963 | —28 | 0-9925 | — 56 | 0-9848 | — 99-5 | 0-9870 | — 152-5 | 0-9845 
— 1-4 | 0-9637 | — 2-8 | 0-9279 | — 13-5 | 0-9948 | — 27 | 0-9895 | —54 | 0-9788] — 96-5 | 0-9821 | — 148-5 | 0-9793 
—1-2 | 0-9445 | —2-4 | 0-8909 | — 13-0 | 0-9928 | — 26 | 0-9855 | —52 | 0-9708 ] — 93-5 | 0-9756 | — 144-5 | 0-9725 
| 
—1-0 | 0-9178 | —2-0 | 0-8407 | — 12-5 | 0-9901 | — 25 | 0-9802 | —50 | 0-9603 | — 90-5 | 0-9672 — 140-5 | 0-9641 
—0-8 | 0-8820 | — 1-6 | 0-7757 | — 12-0 | 0-9865 | — 24 | 0-9732 ] —48 | 0-9466 | — 87-5 | 0-9561 | — 136-5 | 0-9532 
—0-6 | 0-8359 | — 1-2 | 0-6954 | — 11-5 | 0-9819 | —23 | 0-9640 | —46 | 0-9288|-— 84-5 | 0-9420 | — 132-5 | 0-9396 
—0-4 | 0-7787 | —0-8 | 0-6016 | — 11-0 | 0-9759 | —22 | 0-9522 | —44 | 0-9062] — 81-5 | 0-9240 | — 128-5 | 0-9226 
—0-2 | 0-7105 | —0-4 | 0-4984 | — 10-5 | 0-9681 | —21 | 0-9372] —42| 08777] — 78:5 | 0-9015] — 124-5 | 0-9019 
0-0 | 0-6325| 0-0! 0-3921 | — 10-0 | 0-9582 | — 20 | 0-9182 | —40 | 0-8425] — 75-5 | 0-8735 | — 120-5 | 0-8764 
0-2 | 0-5473| 0-4 | 0-2904] — 9-5 | 0-9458 | — 19 | 0-8947 | —38 | 0-7999] — 72-5 | 0-8392 | — 116-5 | 0-8457 
0-40-4583] 0-8 | 0-2006] — 9-0 | 0-9305] — 18 | 0-8660 | —36 | 0-7492] — 69-5 | 0-7981 | — 112-5 | 0-8090 
0-6 | 0-3698] 1-2 | 0-1279| — 8-5| 0-9117] —17 | 0-8313 | —34 | 0-6902] — 66-5 | 0-7495 | — 108-5 | 0-7660 
0-8 | 0-2862| 1-6 | 0-0745 | — 8-0 | 0-8889] — 16 | 0-7901 | —32 | 0-6234) — 63-5 | 0-6934 | — 104-5 | 0-7163 
| 
1-0 | 0-2116 2-0 | 0-0392 — 7-5 | 0-8616 | —15 | 0-7423 | —30 | 0-5501 | — 60-5 | 0-6299 | — 100-5 | 0-6600 
1-2|0-1487| 2-2 | 0-0273| — 7-0 | 0-8293 | —14 | 0-6878 | —28 | 0-4722| — 57-5|0-5601| — 96-5 | 0-5978 
1-4 | 0-0988 2-4 | 0-0184| — 6-5 | 0-7918 | —13 | 0-6270 | —26 | 0-3923| — 54-5|0-4855] — 92-5 | 0-5304 
1-6 | 0-0618 | 2-6 | 0-0120 | — 6-0 | 0-7489 | — 12 | 0-5608 | —24| 0-3138] — 51-5 | 0-4085] — 88:5 | 0-4596 
1-8 | 0-0362 2-8 | 0-0076 — 5-5|0-7006 | —11 | 0-4907 | —22 | 0-2401] — 48-5] 0-3319] — 84-5 | 09873 
2-0 | 0-0197 3-0 | 0-0047 | — 5-0 | 0-6473 | — 10 | 0-4187] —20 | 0-1747] — 45-5 | 0-2589] — 80-5 | 0-3160 
2-1/0-0141| 3-2|0-0027] — 4-5/0-5896] — 9| 0-3472| —18 | 0-1200] — 42-5) 0-1928] — 76-5 | 0-2485 
2-2|0:0099] 3-4/0-0015| — 4-0| 05284] — 8 | 0-2787] —16 | 0-0772] — 39-5 | 0-1362| — 72-5 | 0-1873 
2-3 | 0-0068} 3-6 | 00008] — 3-5 | 0-4650] — 7 | 0-2157 —14 | 0-0462 — 36:5 | 0-0905] — &8-5 | 0-1346 
2-4 | 0-0046} 3-8 | 0-0004] — 3-00-4010] — 6 | 0-1602] — 12 | 0-0255] — 33-5 | 0-0562] — 64-5 | 0-0915 
} | | 
2-5|0-0030| 4-0 | 0-0002 | — 2-5 | 0-3381 — 5|0-1137] —10| 0-0127] — 30-5 | 0-0322] — 60-5 | 6 0585 
2-6|0-0019} 4-2 / 0-0001] — 2-0 | 0-2780] — 40-0767] — 8/|0-0057] — 27-5| 0-0169] — 56-5 | 0-0349 
2:7|0-0012] 4-4/|0-0000] — 1-5 | 0-2223] — 3/0-0490] — 6/0-0023] — 24-5/0-0080| — 52-5 | 0-0192 
2:8 | 0-0007 — 1-00-1725] — 2| 0-0295] — 4/0-0007] — 21-5 | 0-0034] — 48-5 | 0-0097 
eo | — 0-5 | 0-1295] — 1) 0-0166] — 2/|0-0001] — 18-5|0-0012] — 44-5 | 0-0044 
| } | | 
3-0 | 0-0002 | 0-0 | 0-0938} 0/0-0087}  0/0-0000}] — 15-5} 0-0003}] — 40-5 | 0-0017 
3-1 | 0-0001 | 0-5 |0-0654]  1/0-0043 | — 12:5] 0-0000] — 36-5 | 0-0006 
3-2 | 0-0000 1-0 | 00437] 2 | 0-0020 — 32:5 | 0-0001 
| 1:5|0-0279]  3|0-0008 | — 28-5 | 0-0000 
| 2-0 | 0-0170 4 | 0-0003 | 
| } | 
2:5|0-0098]  5|0-0001 
| 3-0 | 0-0054| 6 | 0-0000 
3-5 | 0-0028 | 
4-0 | 0-0014 | | 
| | i Ie 
| | 
5:0 | 0-0003 | 
| | 5-5 | 0-0001 | 
| 
| 
i 
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from a fuller form of the distributions than that given in Table 1, are shown in Table 2. 
These constants involve the following quantities: 

h, the mean time of discard, 

hoo, and hyo; the times of discard exceeded with probabilities of 0-01 and 0-05 respec- 
tively, and 

Xoo, and Xoo; the population values of the dimension at the corresponding times, related 
to the values of h by equation (1). 

Values from Table 2 are plotted against k in Figs. 1 and 2 to facilitate interpolation for 
intermediate values of kt. 














Table 2 
k 1 2 5 10 20 33-3 50 

Mean time of discard, h 0-27 | —0-49 | —4-01 |—i1-61 | —29-68 | —56-68 | —93-16 
Time exceeded with probability 0-01: 

ho-o1* 220 | 268 | 250| —0-22 | -—9-38 | —25-33 | —48-66 

(ho-o1 —) 1-93 3°17 6-51 11-39 20-30 31-35 44-50 

(d— Xo.91)/8.E. —2-20 | —1-34 | —0-50 0-02 0-47 0-76 0-97 
Time exceeded with probability 0-05: 

he-es* 1-68 1-86 0-84 | —3-04 | —14-:29 | —32-82 | —59-21 

(ho-os—F) 1-41 | 235 | 485] 857 | 15:39 | 23-86 | 33-95 

(d—Xo-05)/8-E. -1-68 | —0-93 | -0-17 | 0-30 0-71 0-98 1-18 
































* These are also (Xo.9,—d)/a and (Xp.9,—d)/a according to equation (1). 


APPLICATIONS 


The results of the previous section will usually be applied to control the upper limit of X at 
which tools are discarded, for it is then that defective articles begin to be produced. The 
theory does not allow the specification of an absolute upper limit of X beyond which no tools 
will be used—the assumption of normality allows the theoretical possibility of tools being 
used up to the stage at which X approaches infinity. It is possible only to specify an upper 
limit of X that will be exceeded with some probability, and in the following discussion we 
shall take this probability to be 0-01 considering Xyo, to be the limit of X and ho, as the 
latest time that tools will be discarded. 

First, we may calculate Xo., for any chosen level of discard. The practice in ordinary 
quality control of setting control limits at 1-96 s.z. above and below the control levelt 
suggests, on the face of it, that the level of discard, d, should be at sample means 1-96 s.z. 
below the upper tolerance limit LZ given in the specification of this article. Then 


d = L—1-96s.8., 


and if we subtract d — X99, from both sides we have 
iu d—Xoo 


+ The values in Tables 1 and 2 may have errors of 1 or, occasionally, 2 in the last figure. 
} See Table 10 of pamphlet B.S. 600R, entitled, Quality Control Charts, by B. P. Dudding and W. J. Jennett, 
published by the British Standards Institution. The inner limits are referred to here. 
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From Table 2 we see that, if k = 1, 
Xoo — L+0:24s.5. 


Thus, more than one tool in a hundred are in use after they are producing articles with a 
population mean dimension above the tolerance limit, and hence with above 50 % defectives. 
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(d — X.9,) +8.E. or (d— X.95) +8.E. 
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Fig. 2. 


The situation is better if the rate of wear is greater compared with the standard error of 
sample means, so that k is increased. Thus, if k = 50, 


Xoo, = L-- 2-938... 


Now, in order to determine the percentage of defectives produced just before the time of 
discard, let us suppose that the sample size is 16 and that the standard deviation of individual 
articles made at any one time iso, so that 


Then, for k = 50, Xoo, = L—0-730, 
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and if the frequency distribution of individual articles made at any one time is normal, the 
percentage of defectives at the above value of Xyo, is about 22 (Table 9 of B.S. 600 R). This 
example is sufficient to show that a level of discard related to the tolerance limit in terms of 
only the standard error of the sample mean is unsatisfactory. The rate of change of X with 
time and the size of the sample must also be considered. 

Usually, it will be more convenient to choose X_, and use Table 2 to determine the 
appropriate level of discard. X,.,; may be chosen in relation to the engineering limits so 
that at that value some given percentage of defective articles is produced. The following 
fictitious example illustrates the procedure. 


Example I. Suppose that an article is being produced to a specified diameter with an upper 
tolerance limit of 6-30 in., that a preliminary investigation has shown that the frequency 
distribution of articles produced on one machine at one time is substantially normal with a 
standard deviation of o = 0-057 in., that the variability is in control, that tool wear and 
similar factors cause an average increase in diameter of a = 0-0005 in. per hour in the 
neighbourhood of the time of retooling and resetting, and that eight articles are measured for 
diameter every hour. At what sample mean diameter, d, must the tool be discarded to ensure 
that only one tool in a hundred ever produces more than 10 % defective articles through 
being oversize? What is the average tool life compared with the life that would be attained 
if all tools could be discarded just when they begin to produce 10 % defective articles? 

Table 9A of B.S. 600R shows that the value of ¢, the standardized deviation of the 
variable distributed normally, corresponding to 10% defectives is 1-2816, so that the 
corresponding value of X is 6-30 — 1-2816 x 0-057 = 6-227 in.; this is the chosen X,,,. For 
sample of eight, s.z. = 0-0201 in. and k = 40; and from Fig. 2 we find that 


o- Fett 0-84, 
S.E. 
whence d = 0-0201 x 0-84+4 6-227 = 6-244 in. 


Also we find from Fig. 3 that (ho9,—) = 37, so that the mean life of the tools is 37 hr. less 
than it would have been had it been possible to use them all until they produced 10% 
defectives. 

It should be noted that most of the tools (99 % of them) will never produce as many as 
10 % defective articles, and that some (1 % of them) will produce more; for most of the time 
all tools will produce many fewer than 10 % defectives. Hence, if articles from a number of 
tools in different stages of wear can be bulked, the average percentage of defectives will be 
very low. This average can be determined if the (X, h) curve is known throughout the whole 
of the life of the tool. 

The level of discard can be worked out for an Xo, corresponding to any other limit of 
allowed percentage of defectives, or for a probability level other than 0-01 with which that 
limit is exceeded. The relevant quantities for a level of 0-05 are given in Table 2 and Figs. 
1 and 2, and for other levels they may be obtained from Table 1. The choice of these quan- 
tities depends on technical conditions and requirements, and no guiding rules can be given. 
In practice it will be difficult to make a decision based on measured quantities, and a certain 
amount of guesswork and arbitrary choice will probably be involved. It would be easy, 
but laborious, to calculate the mean percentage of defective articles produced by all tools 


immediately before discard, for various levels of discard, and to use this in choosing the ' 


level; but it is doubtful if such a choice would be any easier to make in practice. 
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The above application considers the approach of articles to the upper tolerance limit only; 
the lower limit will determine the original setting of the tool in the machine. If the trend is 
one of decreasing dimension, the signs of all the quantities are reversed and the approach 
to the lower tolerance limit is controlled. The procedure does nothing towards controlling 
erratic fluctuations. 


Example II. This example concerns the times to melt successive casts of steel in an open- 
hearth furnace, and the data are taken from Statistical Methods in Industry,* Fig. 7 and 
Table XX. In the figure, the melting times for successive casts are plotted against the cast 
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Fig. 3. 


number, and for A furnace there is a substantial random fluctuation superimposed on a 
secular movement which appears to be roughly periodic, rising to peaks at intervals of 
40-50 casts. A secular variation of this sort could be due to the gradual obstruction of the 
passages for the gases, which are cleared at intervals. The random variations could be due 
to such things as variations from cast to cast in the material charged into the furnace. It is 
not known whether these factors explain the fluctuations for A furnace, but for the sake of 
argument let us suppose that it is so, and let us derive a procedure for deciding when to clean 
the flues. The data are inadequate to justify this supposition and can provide only very 


* Published by the Iron and Steel Industrial Research Council of the British Iron and Steel Federation. 
Biometrika 33 It 
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rough estimates of the various quantities; but for the purposes of illustration this does not 
matter. 

Ideally, we would describe the state of the furnace by the time it takes to melt an average 
cast, undisturbed by random fluctuations; this is our unknown population value, X. The 
standard deviation of the random fluctuations in the actual times about X is the s.£., and 
may be roughly estimated from the mean range between consecutive pairs of times. In this 
way, S.E. is estimated to be 1-17 hr. A rough estimate of a, the rate of increase in melting 
time, obtained from the figure is 0-16 hr. per cast, so that 


(fd is the cast number), and from Fig. 2 we find that 
(d —_ Xo01)/S-E. = 0-23, 
whence (d — Xgo,) = — 0-27 hr. 


Suppose it is decided that X,., shall be 14 hr.; then d = 13-7 hr. From Fig. 3 we find that 


(hoo: —h) = 7°5 casts, and the average melting time immediately before the furnace is 
cleaned is 14—7-5 x 0-16 = 12-8 hr. 


There are some conditions to be satisfied before the results of this paper can be applied. 
The assumption of the constancy of the standard error of sample means involves the assump- 
tion that the variability is in control. The mean dimension need not be in control, except 
for the trend, but it is better that it should be so. Otherwise the variability due to uncon- 
trolled random variations in X must be added to the sampling variations when calculating 
the s.z. The question of control of X does not arise in circumstances like those envisaged in 
Example II, where the s.£. is due entirely to random variations in X. Another condition is 
that a and s.£. are known and that the sampling variation of Z, the sample means, is normal. 
All these conditions require a preliminary investigation. 


OPTIMUM CONDITIONS FOR CONTROL 


It will be interesting to see whether it is better to take large samples at infrequent intervals 
or small ones at correspondingly more frequent intervals, the total number of tests over a 
given length of time remaining constant. For the probability level of 0-01, it is immaterial 
whether the intervals are frequent or infrequent provided that 
(hoor —h) = okt, 

where @ is a constant. For if the interval between the samples is increased from 1 to r units 
of time and the sample size is increased to r times its original value, the new standard error 
is s.E.’ = S.E./,/r, the rate of deterioration in terms of the new units of time is a’ = ra, the 
new k becomes k’ = k/r?, and the new (hy.,; —h) becomes ak’? = ak/r in the new units, or 
ak in the original units of time, which is the same as the original value of (hyo; —h). When 
the values of (ho9,—h) in Table 2 are plotted against k#, they are found to lie on a curve 
starting near the origin and concave to the (hyo, —h) axis; i.e. (hgg,;—h) increases more 
rapidly than in proportion to &, and it is better to have a large value of r so as to keep k 
small. The same is true of (hgo,—h). Hence, within the limits of this investigation, for a 
given total number of articles tested in a given total time, it is better, from the point of view 
of keeping the average tool life as near as possible to the chosen limiting life, to take large 
samples at relatively infrequent intervals than small samples at frequent intervals. 
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We may illustrate this by Example I. If samples of thirty-two articles are taken at 
4-hourly intervals, 
@ = 4x 0-0005 = 0-0020 in. per interval, 


0-057 
= 32, = 0-101 in 9 
k = 5-05, 
and from Table II, (hoo: — 2) = 6-5 intervals 


= 26 hr. (approx.). 
This may be compared with the 37 hr. for hourly samples of eight articles. 

Another method of improving control that suggests itself is to combine each sample mean 
with the few previous means, using the assumption of a linear trend to obtain an improved 
estimate of X at the instant when the last sample is taken. 

Let the number of means combined in this way be m, and let them be designated 


Meee 


Z,, being the last in the series. Then it is easy to show that if a straight line is fitted to these 
values by the method of least squares, the improved estimate of X at the last sampling 
instant, %,, say, is given by the equation 
z, ys 1)z 4 
on Hit) )Zy (4) 
and the standard error of this estimate, s.£.,, say, by the equation 


2(2m — 1) 


S.E.,, = /———— x 
7 m(m +1) 


S.E. (5) 


Table 3 shows how the precision of the estimate increases with m. 








Table 3 
m S.E.,,/S.E m S.E.,,/S.E 
1 1-00 5 0-77 
2 1-00 6 0-72 
3 0-91 8 0-65 
4 0-84 10 0-59 


























It is wrong, of course, to combine so many means extending over such a length of time 
that the departure of the (X,h) curve from linearity for that time is appreciable compared 
with the sampling errors. 

Suppose in Example I the hourly results are combined in this way in fours. Then 

S.E.,, = 0-84 x 00201 = 0-0169 in., 
k = 33:8, . 
and (hoo: —h) = 31 hr. (approx.). 
For this example, the concentration of the thirty-two tests at 4-hourly intervals, giving 


(hoo: —h) = 26 hr., remains the best arrangement. Sometimes, however, it is better to test 
more frequently and combine means by this second method. For example, if with hourly 
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tests, k = 31-62, then (hoo, —h) = 30 hr. If ten times as many tests are taken at 10-hourly 
intervals, k becomes 1-0 and (hoo, —h) = 10 x 2-20 = 22 hr. If the hourly tests are combined 
in tens, k becomes 0-59 x 31-62 = 18-6, and (hyo; —) = 19 hr. This second method of com- 
bination is the only one available in cases like that of Example II. 


SUMMARY 


1. If samples of a product deteriorating linearly with time are taken at intervals, and 
action is taken when the sample values reach a certain level, such action will be taken, 
sometimes before and sometimes after the population value has reached that level. 

2. Frequency distributions are given for the times at which action is taken. These dis- 
tributions depend on the standard error of the sample mean, the size of the sample and the 
rate of deterioration of the product. 

3. A procedure is given for deciding when to take action. 

4. For a given number of tests, it is better to take large samples infrequently than small 
ones frequently. A method is given for combining the results of several consecutive samples. 
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STUDENTIZATION 


OR THE ELIMINATION OF THE STANDARD DEVIATION OF THE PARENT 
POPULATION FROM THE RANDOM SAMPLE-DISTRIBUTION OF STATISTICS 


By H. 0. HARTLEY 


1. INTRODUCTION 


In this paper we give a mathematical solution of a problem in statistical distribution, some- 
times referred to as ‘Studentization’. This process is best described by recalling briefly the 
properties of two well known examples: 

Let 2}, ...,%, denote a sample of n observations drawn from a normal distribution with 
mean 4 and standard deviationo. Then the difference between sample and population mean, 
—y, is normally distributed with zero mean and standard deviation o/,/n. If we estimate 
the standard deviation of the parent, a, by 


8 = \[2(x—2)*/(n—1)], 


then the ratio t = (¥—p)./n/s 


is distributed according to Student’s ¢-distribution which does not depend ona. 
Consider now a second, independent sample X,, ..., X,, drawn from the same population. 
The random sample distribution of its standard deviation 


s* = J[3(X—X)?/(m—1)] 


follows a well-known probability law which, again, involves the value of c. If, again, we 
estimate this value ofo by using the standard deviation, s, of the first sample as its estimate, 
then the distribution of the ratio 

JF = 8*/s 


is independent ofc. This distribution is, of course, a transformation of Fisher’s z-distribution. 

It will be noted that the original statistics, ¥— and s*, as well as s, are proportional too 
in the sense that if the observations x; and X; are measured in units of¢ the above statistics 
are, by this change of scale, transformed into (—)/o, s*/o and s/o respectively, and their 
parental distribution is transformed into the normal distribution with unit standard devia- 
tion. Hence the random sample distributions of the ratios 

is Pe FR Pt 
ME A OG 2 Sore: g 
do not involve the value ofc, nor is it necessary to know this value ofo in order to calculate 
the above ratios. The latter may be called the studentized statistics corresponding to the 
original statistics 7—y and s*. 

This simple studentization process of dividing a statistic by an independent estimate s of 
the parental standard deviationg may be applied to a number of other useful statistics which 
are also proportional to 7 in the above sense. We may mention here the range in a sample, 
the extreme observation, the extreme value in a sample of random standard deviations, 
but there are many others. 

It is the object of this paper to derive a general formula for the studentized distribution 
law of such statistics. The accuracy of the formula will be demonstrated with a numerical 
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example and a new formula for the incomplete Beta function derived in the process. This 
result has already been used in recent numerical work on this function (Thompson 1941). 
A second application of the method may be found in a paper by Pearson & Hartley (1943), 
and it is hoped to publish others. 


Finally, computational methods for the systematic tabulation of studentized distribution 
laws are briefly outlined. 


2. DEFINITIONS AND NOTATION 
A problem akin to the present one has been considered in a previous paper (Hartley, 1938), 
but to fix ideas we shall repeat here certain definitions and results given there. 

Let x,, ...,2, be a sample of n observations drawn from a normal population with standard 
deviationo. Consider a general statistic W, calculated from this sample, having the following 
properties: 

(a) W is positive. 

(6) W is ‘proportional toc’ in the sense that if the observations x; are measured in units 
of o the statistic W is thereby transformed into W/c. 

Let f(W) be the distribution function of W for the case where is known to be unity, and 


Ww 
let P(W) be the corresponding probability integral, ie. P(W) -| J(W)dW. Also let s be 
0 


an independent estimate of the standard deviation based on n degrees of freedom. Finally, 
let} S = Jjns = .,/(Sum of squares) and r= W/S. 


Further denote by f,,(r) the random sample distribution of r and by p,,(R) its probability 
integral, i.e. R 
P,(R) ~ 4 f(r) dr. 


Using this notation we proved (see Hartley, 1938) that 
p,(R) = I'(4n)* sien" gete P(SR)dS8. (1) 
0 


This equation gives p,(R), the ‘studentized integral’, formally in terms of the ‘oo-integral’ 
P(W). We also proved the recurrence formula 


R 
Pn_o(R) = (n—2) R0-® | Dalr)r"-Sdr, (2) 
0 


with the help of which the degrees of freedom of p,(R) may be reduced by 2. 


3. PROPERTIES OF THE RECURRENCE FORMULA 


The series of studentized integrals p,(R) is uniquely determined by the recurrence formula 
(2) and the relations 


. W . ; 

lim p,, (=) = P(W) (uniformly in W), p,, bounded, (3) 

ao \Vn ; 

which follow from an inequality derived previously (see Hartley, 1938, (10)) and from (2). 

To prove this suppose there were two series of piecewise continuous functions p,(R) and 
t It will be seen that this condition is not essential for our results. Also, if the distribution function of W is 

symmetrical about 0, condition (a) will be satisfied for the statistic |W| which may be used in place of W. 


} It was necessary to alter the notation used previously (Hartley, 1938) writing S in place of p and r in place 


of g, in order to fall in with the notation used by other writers (e.g. Newman, 1939) who reserve q for the ratio 
W/s. 


m,(R) 
for 0< 


Put 
the 
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m,(R) both satisfying (2) and (3), and let us denote by 4,, the maximum of | p,(R)—7,,(R) | 
for 0< R<oo. From (3) we obtain , 
lim A,, = 0. (4) 


n—>o 


On the other hand, we find from equation (2) that 
R 
Pn-a(B)—My_a(R) = (n= 2) R-*-®| Tp,(r)—a(r)] *-Adr, 
0 


from which we obtain A,-2<(n—2) RA, =a = A,, 
i.e. A,-2<A,. (5) 
From (4) and (5) it follows that 
A, =90 
for all n. 

The recurrence formula (2) may be used to produce by quadrature p,,_,(R) from p,(R). 
The difficulty is that none of the integrals p,,(R) is known so that we have no starting point. 
All we know is that W 

lim p, (=,) = P(W). 
n—>o Vn 
This suggests that a first and main step of the process will consist in bridging the gap between 
n = 00 (i.e. P(W)) and moderate values of n (i.e. p,,(R)), say, covering the range between 10 
and 50. This step will be accomplished with the help of the theory of partial differential 
equations in the next section. 


4, THE PARTIAL DIFFERENTIAL EQUATION OF p,(R) 


We may write formula (2) in the form 
R 
Repy(R) = | Paral) ™Adr. (6) 


Differentiating with regard to R we obtain 





d 
RAS —n Puse(R)—Py(B)] = 0. (7) 


We now regard the degrees of freedom as a continuous second variable capable of attaining 
positive values and try to find a function of two variables p(n, R) for which the 


relations 
Op(n, R 
RPO) _ nipin+2, R)—pln, R)] = 0 (8) 
W P 
7) = PW 
and Jim p(n, =) PUW) (9) 


hold. Again this function p(n, R) is uniquely determined by the relations (8) and (9). This is 
easily seen from integrating (8) which yields . 


R 
R*p(n, R) = nf p(n + 2,r)r™— dr +9(n). 
0 


Putting R = 0 we see that g(n) =0, ie. that equation (6) holds. The remainder follows by 
the argument given in § 3. 
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To convert equation (8) into a partial differential equation we expand the finite difference 
in a Taylor series and write 


aR "| 7 an + 7 dn2t Sant Bont | (10) 


where p is taken at arguments R and n whilst > is taken at R and some mean value between 
n and n+ 2. 


We now introduce as new variables 


2 3 a7 
9°? 1 OP. 40%p a | 0, 


=logR and 2x =logn. 
The partial differential equation is thereby transformed into 
Op ,0p_ , (0p Op\ , (8p 0% ) 
Se ** (2-2) ae = (55 -35+ +35. 
op oD op Op 
2e-3ej_f _ 
+2e (3 aa tilss— oe), (11) 


where the arguments of p are y and x and those of ) are y and some mean value x between 
x and log (e* + 2). 


5. THE SOLUTION OF THE PARTIAL DIFFERENTIAL EQUATION BY ITERATION 
The partial differential equation (11) cannot, of course, be solved by standard methods. 


However, it is possible to find an approximate solution for large and moderate values of x 
with the help of an iteration process. For xo the equation (11) tends to the limiting form 


a ag =O (12) 
the general solution of which is given by 


Polx,y) = (y+ 32), 
where ¢(A) is an arbitrary differentiable function of A. In order to satisfy condition (9) we 
have to put (A) — P(e’), (13) 


so that we obtain for the solution p,(z, y) 


Polx,y) = P(er*#). (14) 

The first approximation p,(x,y) to the studentized integralf is therefore a function of 

(y+4a) only ie., expressed in terms of R and n, py depends on R./n only. This means 
that for large m the studentized integral is, to a first approximation, independent of n. 

In order to work out closer approximations we have to solve the non-homogeneous partial 

differential equation 


0 
ete = y.2). (15) 


A partial solution of this equation can be found with the help of the theory of characteristics 
and is given by y 
P(x, y) = |v, — 2p + 2y +x) dp, (16) 


+ We shall confine ourselves here to working out the first three steps of the iteration process. It would lead 
us too far afield if we would give here the proof for its convergence. The check on the accuracy of the approxima- 
tion is discussed in the last section. 


usi 


nce 


(10) 


een 


een 
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as is easily verified by differentiation. The general integral of equation (15) is therefore 

given by y 

p(z,y) -| ¥(p, —2p+ 2y+x)dp+(y+ 42), (17) 
Ve 


where ¢, yp are arbitrary and p is a variable of integration. If the function y(y, z) is of the 

special form N 

Wy,2) = Dew yly+ 42), (18) 

and if we put y)= —oo we find by substituting (18) in (17) and by integrating each term 
p(x, y) = Ply + 32) + 3S — y(4e+ 9). (19) 


The second approximation (p,(z, y)) to the solution of (11) is now obtained by solving 





Op, Op, e-2 O*py Og 
ey a ™ (- 2). _ 
using po(z,y) = $(y+ 4a). From (15), (18) and (19) we obtain 
P(x, y) = (y+ 32) + e739" — 39’), (21) 


where we denote by ’, ”, ”, etc., the order of the derivative of the function ¢ with regard to 
its argument. The function p,(z, y) is a solution of the equation (11) if all terms involving 
e~**, e-*, etc., are ignored. 
The third approximation p,(z, y) is now obtained as a solution of the equation 
pe Ps _ de (Et Ps) + gee (SE-aPt4 2), (22) 


where, in the right-hand side, we may ignore all terms involving e~**, e—**, etc. From equa- 
tions (18) and (22) we obtain for the solution (19) 


Polt,y) = P+e-*(2h" — 34") + “(ge — 2a9" + 89" —69'), (23) 


which is a solution of (11) ifall terms involving e~**, e—**, etc. “ie 1/n*, 1/n*, etc.), are ignored. 
For samples of the order 20 or larger this formula will in most cases give quite sufficient 
accuracy. For smaller samples or high accuracy the term involving e~* will be required. 
We find by a further iteration 


ee : « » 
Pal®,Y) = Pal®,¥) + Fy (seO"! — E89" + 7 OY — 7S" + 39’). (24) 
Let us write formula (23) in terms of the statistic 
R Jn = elv+42)_ 


We have the following relations between the function ¢(A) and the oo-integral P(W) 


(A = y+ 42) 
¢ = Pee’), 


¢’ = P’e',z 
go” = Pe + P’ed, 
Go” = P™e*+3P*e*+ Pe’, 
dv = Pivet + 6 Pe + 7P"e* + P'e’. 











178 Studentization 


Substituting in equation (23) we obtain the chance p,,(R) for the statistic r = W/s./n to fall 
below R which is identical with the chance for the studentized statistic W/s to fall below 
R.Jn = Q (say). We obtain from the second approximation, i.e. from formula (23) 


1 1 ” ’ 
pa(R) = P(Q)+ 7 (Q*P"— QP’) +75 5(4GP*—3QP"-3QP"+}QP'), (25) 

where the argument of P and its derivatives is Q, since 

d(y + 4x) = P(ert#*) = P(R.n) = P(Q). 
With equations (25) and (24) we have reached the solution of the studentization which is 
of sufficient accuracy for most practical purposes. In the following sections we shall show 
with an example how these equations may be used in practical instances. We shall see that 
the formulae lend themselves very well to the computation of p,(R). For a fixed value of Q 
Pr(Q/./n) is a quadratic in 1/n if equation (25) is used. The formula is of a fair accuracy even 


for small values of n. However, if higher accuracy is required it is preferable to include the 
term involving 1/n* which is given by equation (24). 


6. A NEW FORMULA FOR THE INCOMPLETE B-FUNCTIONT 
As is well known the incomplete B-function, the probability integral of Fisher’s z and of 
Snedecor’s F, are all transformations of the same probability law which is the studentized 
integral of a sample standard deviation s,. If s? and s3 are both estimates of o? based on n, 
and n, degrees of freedom respectively, then the probability that the ratio s?/s3 falls below 
a a by I,,(4%), $n), 
where J,, is the incomplete B-function and 
_ mF 
0 gt, F* 





(26) 


Whilst for large p and q there are a number of useful methods facilitating the approximate 
evaluation of I,( p,q) (see e.g. Soper, 1921 and Wishart, 1927) there appears to be a lack of 
suitable methods if p is small or moderate and q is large. This is a case where our method 
of studentization may be usefully employed. 

The distribution of s, for n, degrees of freedom is given by 


f(s) = (hn) 2 np sp tetas. (27) 
All we have to do therefore is to substitute in equation (25) f(s,) for P’ and the derivatives 


of f(s,) for P”, P” and Pv respectively. The argument Q is to be replaced by the square root 


of the variance ratio F and n by ng, whilst P itself is the probability integral of ,/(x?/n,) for 
n, degrees of freedom. 


We find without difficulty 


Palm) = PP) + URS — VEN + peg APY" -EPY"-BES'+ AVES), (28) 


where f is given by (27) and the functions f’, f” and f” are its derivatives whilst the argument 
8, is replaced by ./F. The integral P* is the probability integral of s, or that of ./(x?/n,). 


t Since this paper was written my attention has been drawn to a paper by A. G. Campbell (1923) in which 


a similar expression has been derived. Campbe!!’s formula is mentioned and used in the recent book by Simon 
(1941). 
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It is most convenient to adopt the notation which has been introduced by Karl Pearson 
in his work on the incomplete B-function. The integral p,, will then become an approximation 
to the function I, (p,q) where 2p = n,, 2q = n, and 2, is given by (26). If, further, 





en _ 9% 
w = $Fn, = = (29) 
we obtain from equations (27) and (28) the new formula for the function L,( p,q), viz. 
I,,(p,4) = P*(VF) + te-°w?q* I'(p) (p— 1—@) 
+ few? gq * Ip) {3 p* — Sp + Sp — 3 —o(Sp* — 4p + 3) +@*(Sp—%)— 30%}, (30) 


where P*(,/F) is the probability integral of ./(x?/n,) for 2p degrees of freedom. 

The formula is of considerable help for large values of g and small or moderate values of p. 
If p and w are kept constant I, (p,q) is seen to be a quadratic in 1/q. 

To give some idea of the remarkable accuracy of the formula we consider the example 
p = 1, = 5-555 and g = 5(1) 10(2-5) 25(5) 50. A comparison of accurate value and approxi- 
mation is given in the table below. The method has been used to calculate values of the 
incomplete B-function for g = 120 and small values of p. These were required for the new 
table of percentage levels of I,.(p,q) prepared by Biometrika (Thompson, 1941). 
































| I{1, 9) | I{1,9) 
2g | 2g | 
Exact | Approx | Exact Approx. 
| 
100 09948462 | 0-994 8452 30 0991140 | 0-991093 
90 6916 6901 25 0-989 922 0-989 830 
80 4052 | 4930 20 0-987 95 0-987 77 
70 2374 2342 is | 0-98679 0-986 55 
60 0-993 8847. |  0-9938795 16 6| 098529 0-984 94 
50 374 365 14 | 0-983 26 | 0-982 73 
| 45 022 010 12 | 0-98041 | 0-979 54 
40 0-992 572 | 0-992 553 10 0-976 26 | 0-974 63 
35 | 0-991973 |  0-991943 





7. SOME REMARKS ON THE TABULATION OF ‘STUDENTIZED INTEGRALS’ 


Formulae (24) and (25) are really the first three or four terms of an expansion of the student- 
ized integral in powers of 1/n, and it is necessary to know something about the accuracy of 
this approximation. The neglected remainder of the expansion depends on the derivatives 
of the oo-integral P(W) and it is difficult to reach a general formula which may be used as 
a convenient gauge for the estimation of its magnitude. Nevertheless, it will be possible to 
control, numerically, the accuracy of formula (25). Thus, if it is desired to tabulate the 
studentized integral to a certain decimal accuracy (say 3 or 4 or 5 decimal accuracy) one would 
proceed on the following lines: 

(i) From the magnitudes of the coefficients of n®, n-1, n-* and n-* in equations (25) and 
(24) estimate roughly the smallest value of n (n’ say) for which equation (25) represents 
p,(£) sufficiently accurately for large values of 2. 

(ii) For two or three values of n in the neighbourhood of the above value »’ and paired 
with two or three values of R, calculate the exact studentized integral p,(R) from 
equation (1) by numerica! quadrature and compare the result with the value of p,,(R) given 
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by (25). This comparison should fiz the smallest value of n (n, say) for which the approximate 
formula (25) yields the desired decimal accuracy. 

(iii) For n>n +2 use formula (25) for the tabulation of p,(R). For n<n,+2 use the 
recurrence formula (2) (starting from )+2 and n,+ 1 respectively) to produce in turn the 
exact p,(R) by numerical quadrature} reducing the degrees of freedom by 2 in each step 
of the recurrence. The first step of this process should produce the studentized integral 
Pp, (R) and this should agree identically with the one calculated from formula (25). 

Finally, we should mention a simple transformation of the recurrence formula (2) which 
makes it possible to carry out the quadrature (iii) with the help of a mechanical integrator 
known as the ‘ Differential Analyser’. A good description of this machine is given by Hartree 
(1935). We first transform the R-scale by introducing p = R-*. We then replace the function 
Pr(R) = p,(p~*) by 

7,(p) = [1—p,(e~*)]o-™. (31) 
The recurrence formula (2) is thereby transformed into 


m,(P) =n | “tysale)dp, (32) 


Thus 7,,(p) is seen to be a multiple of the integral of 7,,,,.() and the recurrence process is 
reduced to ordinary integration. 


This process is easily carried out on the Differential Analyser. For instance, with a capacity 
of six ‘integrator units’, six ‘output-tables’ and one ‘input table’, six integrations may be 


carried out simultaneously and 7,,_», 7,4; T,~6: ™n—» M19 and 77,1. produced from 7,,,(p) 
in one ‘run’. . 


REFERENCES 


CaMPBELL, G. A. (1923). Bell. Syst. Tech. J. January, 1923. 

Hart ey, H. O. (1938). J.R. Statist. Soc. Suppl. 5, 80. 

HartTREE, D. R. (1935) Nature, Lond., 135, 940. 

Newman, D. (1939). Biometrika, 31, 20. 

Pearson, E. 8S. & Hartiey, H. O. (1943). Biometrika, 33, 89. 

Smon, L. E. (1941). An Engineer’s Manual of Statistical Methods. New York: 
John Wiley and Sons, Inc. 

Soper, H. E. (1921). Tracts for Computers, No. vu. Camb. Univ. Press. 

Tuompson, C. M. (1941). Biometrika, 32, 151. 

WisuHakt, J. (1927). Biometrika, 19, 1. 


+ Note that with the recurrence process (2) only one numerical quadrature is required to produce p,_,(R) for 


all values of R, whilst the exact formula (1) necessitates a separate quadrature over the range 0 to for every 
single value of R. 
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MISCELLANEA 


Note on the use of the tables of percentage points of the incomplete beta function 
to calculate small sample confidence intervals for a binomial p 


By HENRY SCHEFFE, Princeton University 


One of many once tedious statistical computations which may now be made simply and directly by 
the use of Miss Thompson’s recent tables (1941) is the calculation of confidence intervals for a binomial 
p when the sample size is small, so that one hesitates to use the normal approximation to the binomial 
distribution. The problem was treated by C. J. Clopper & E. S. Pearson (1934). In their article they 
included charts yielding confidence intervals for e = 0-01 and 0°05, where 1—e denotes the confidence 
coefficient. With the new tables these cases as well as ¢ = 0-02, 0-1, 0-2, 0-5 are easily handled. 

Let x be the number of ‘successes’ observed in a sample of n trials on a binomial population for which 
E(xz/n) = p. Denote by p,(x)SpSp,(x) a confidence interval for p with confidence coefficient 1—e. 
From the work of Clopper & Pearson we find that p,(x) is determined by the equation 


2 Crk —P,)"%= fe (x<n), (1) 
I= 


while p,(n) = 1, and that p,(x) may then be found from 
P(x) = 1—p,(n—=). (2) 


Karl Pearson (1924) showed how the left member of (1) may be evaluated in terms of the incomplete 
Beta function, ; 


E Op (lp) =1,_,(n—2, 241) (e<n). (3) 
j=0 


From (1), (2), (3) we deduce the following rule for calculating the confidence limits p,(x) and p,(zx) 
from Miss Thompson’s tables: Enter the table for the 100(4¢) percentage point with v, = 2(n—2x+1), 
v,= 2x to read p,(x) directly; in the same table subtract from unity the entry for v, = 2(x+1), 
V, = 2(n—2) to get p(x). The exceptions to this rule occur for p,(0) and p,(n), to which we assign the 
values 0 and I, respectively. If the percentage point for the desired »,, v, is not tabulated, Hartley’s 
Methods of Interpolation preceding the tables will yield it fairly quickly, especially since only two 
decimals will be wanted ordinarily. 
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